35/185 



ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv0546c 



T / 1 
1/ 1 


















^ X / X X 


















tag tea 




cgt 


gca 


-f- 4- r-* 

uue 


gae 


gac 


get 


g L.a 


eta cee 


get 


ggt 


ggc 


aae 


tee 


y dX. 


^a -h 

gau 


tgc 


AMB ser 


gly 


arg 


axa 


pne 


asp 


asp 


axa 


vaj- 


leu pro 


axa 


gly 


gly 


asn 


ser 


asp 


asp 


cys 


£Z 1 /on 


















Qi /"^i 

^ X / O X 


















gcc gac 


gaa 




etia 


ega 


egg 


get 


gee 


ggc 


get gae 


cgc 


cgc 


gga 


age 


cgc 


ega 


gtg 


gat 


ala asp 


glu 


gly 


leu 


a rg 


arg 


axa 


axa 


gly 


ala asp 


arg 


arg 


gly 


ser 


a rg 


arg 


vax 


asp 


± / *i J. 


















151/51 


















ggt ca.c 


cgc 


cgc 


ccg 


cac 


ccg 


acc 


ggt 


gcg 


yd\, t-yc 


gcc 


teg 


ggt 


tgc 


cgt 


cgc 


cgt 


caa 


gly his 


airg 


arg 


pro 


hi s 


pro 


UIlx 


g^y 


o. X d 


asp arg 


dXd 


ser 


gly 


cys 


arg 


arg 


arg 


gin 


± O X / DX 


















911/71 

^ X X / /X 


















cgc get 


gga 


cag 


O a. L- 


egg 


tec 


eeg 


etg 


gg"^ 


caa tgc 


act 


cat 




gcg 


ccg 


caa 


ega 


aca 


airg ala 


gly 


gin 


his 


arg 


ser 


p ro 


leu 


gly 


y xii o 


thr 




ala 


ala 


pro 


gin 


a r g 


Uii X 


^ fi X / ox 


















271/91 


















get caa 


cc e 


■f- -hrr 


aae 


egg 


gte 


ccg 


gee 


t ge 


v_- y cx *. — ^> — 


t eg 


gee 


gee 


rr rr c 
y y O 


y uy 


eeg 


eta 


cgt 


ala gin 


p iro 


leu 




O. X ^^ 


val 


p r o 


ala 


cy s 




ser 


ala 


ala 


y xy 


val 


pro 


leu 


arg 


^01 /I 01 

O W X / X W J. 


















331/111 


















gat aga 


ca e 


agg 


g cc 


-h rr 


gaa 


t- r' 


etg 


gee 


dy ^y y 


a -h rr 
d L.y 


r-t- a 
o L-d 




egg 


ccg 


gcg 


gae 


t- 1- 

L-d L. 


asp ajcg 


hi s 


a rg 


axa 


itie t 


glu 


i 1 Pi 
X X c; 


leu 


alp 
dX d 


ser a r g 


met 


1 eu 


leu 


arg 


pro 


dX d 


asp 


tyr 


'^(^1 / 1 0 1 
O D X/ XZ X 


















391/131 


















cag egg 


teg 


etg 


age 


t L.C 




cgt 


gac 


cag 


acc ggg 


etg 


gcg 


a u u 


gee 


cgt 


gaa 


i- s c 

uac 


ggg 


gin axg 


S e 


leu 


ser 


phe 


tyr 


arg 


asp 


glxi 






ala 


ile 


ala 


arg 


glu 


tyr 


gly 


A Ol / 1 Al 
X/ ±H X 


















4 SI /I SI 

ri J X / X -J X 


















gcc yyc 


a ca 


gtg 


t- tt 


t tr' 




y y ^ 


a rr 


tea 


o y v_- 


gaa 


etg 


gcc 


y y u 


tac 


y y o 


rr A rr 
y dy 


ccg 


ala gly 


thr 


val 


phe 


phe 


ala 


gly 


gin 


ser 


leu leu 


glu 


leu 


ala 


gly 


tyr 


gly 


glu 


pro 


481/161 


















511/171 


















gac cat 


teg 


egg 


gga 


cet 


ttt 


ecc 


ggc 


gcg 


etg tgg 


etg 


cag 


gtg 


cgc 


gac 


etc 


gag 


get 


asp his 


ser 


arg 


gly 


pro 


phe 


pro 


gly 


ala 


leu trp 


leu 


gin 


val 


arg 


asp 


leu 


glu 


ala 


541/181 


















571/191 


















acc cag 


ace 


gag 


etg 


gte 


age 


ega 


gge 


gtg 


teg ate 


get 


cgc 


gag 


ecc 


cgc 


cgc 


gaa 


ccg 


thr gin 


thr 


glu 


leu 


val 


ser 


arg 


gly 


val 


ser ile 


ala 


arg 


glu 


pro 


arg 


arg 


glu 


pro 


601/201 


















631/211 


















tgg ggc 


etg 


cac 


gag 


atg 


cat 


gtg 


acc 


gac 


cca gac 


ggg 


ate 


aca 


etg 


ata 


ttc 


gte 


gag 


trp gly 


leu 


his 


glu 


met 


his 


val 


thr 


asp 


pro asp 


gly 


ile 


thr 


leu 


ile 


phe 


val 


glu 


661/221 


















691/231 


















gtt ecc 


gag 


ggt 


cac 


ccg 


etg 


cgt 


aca 


gac 


ace egg 


gcg 


tga 














val pro 


glu 


gly 


his 


pro 


leu 


arg 


thr 


asp 


thr arg 


ala 


OPA 















SEQ ID No.llF 
FIGURE IIP 



REPLACEMENT SHEET (RULE 26) 



36/185 



1 / T 
1/ i 


















1 / T 1 
OX/ i. X 
















Hi 


agg 


gat 


ttc 




act 




teg 


gcc 


1^ y L. d d y 


gca 


a eg 


cga 


ggt- 


ett 


cat 


gcc 


S S p 3 


arg 


asp 


phe 


ala 


thr 






ala 


^ y iZ3 ^ y O 


ala 


thr 


arg 


Y 


leu 


hi s 


ala 




















91/31 


















gac 




aag 


aga 


cag 


y yd 


age 


tga 


^y a y ^ 


cgc 


gt a 


ccg 




cgc 


cat 


tct 


asp vaX 


asp 


arg 


1 vs 


ara 


crln 


q1 V 


ser 


OPA 


OPA arg 


arcf 


val 


pro 


asp 


arg 


his 


ser 


121/4 1 


















151/51 
















gag tct 


ttc 


cga 


gtt 


cag 


caa 


caa 


teg 


aca 


cag aag 


egg 


gga 


cea 


gac 


egg 


gag 


gac 


glu ser 


phe 


arg 


val 


gin 


gin 


gin 


ser 


thr 


gin lys 


arg 


gly 


pro 


asp 


arg 


glu 


asp 


181/61 


















211/71 
















gcg gcc 


egg 


gcc 


get 


teg 


ggc 


cga 


gtg 


tct 


gag taa 


gac 


cag 


agt 


cac 


ggg 


tec 


gtg 


ala ala 


arg 


ala 


ala 


ser 


gly 


arg 


val 


ser 


glu OCH 


asp 


gin 


ser 


his 


gly 


ser 


val 


241/81 


















271/91 
















gac aac 


cgc 


gcg 


gaa 


ttc 


aat 


egg 


atg 


gcg 


ggc ggg 


acc 


gga 


ttg 


cgc 


egg 


tea 


ccg 


asp asn 


arg 


ala 


glu 


phe 


asn 


arg 


met 


ala 


gly gly 


thr 


gly 


leu 


arg 


arg 


ser 


pro 


301/101 


































aac etc 


egg 


agt 


gat 


c 


























asn leu 


arg 


ser 


asp 





























gag 
glu 

gtc 
val 

gac 
asp 

tgt 
cys 



SEQ ID N0.12A 
FIGURE 12A 



1/1 

acc gaa ggg att 
thr glu gly ile 
61/21 

aeg tag aca gga 
thr AMB thr gly 
121/41 

agt ett tee gag 
ser leu ser glu 
181/61 

egg ccc ggg ccg 
arg pro gly pro 
241/81 

aca acc gcg egg 
thr thr ala arg 
301/101 

ace tec gga gtg 
thr ser gly val 



teg cga eta act 
ser arg leu thr 

aga gac agg gaa 
arg asp arg glu 

ttc age aac aat 
phe ser asn asn 

ett egg gcc gag 
leu arg ala glu 

aat tea ate gga 
asn ser ile gly 

ate 
ile 







31/11 


egg 


cet 


gta agg 


arg 


pro 


val arg 






91/31 


get 


gat 


gac gtc 


ala 


asp 


asp val 






151/51 


cga 


cac 


aga age 


arg 


his 


arg ser 






211/71 


tgt 


ctg 


agt aag 


cys 


leu 


ser lys 






271/91 


tgg 


egg 


gcg gga 


trp 


arg 


ala gly 



caa cgc gag gtc 
gin arg glu val 

gcg tac egg acc 
ala tyr arg thr 

ggg gac cag acc 
gly asp gin thr 

acc aga gtc acg 
thr arg val thr 

ccg gat tge gcc 
pro asp cys ala 



ttc atg ccg agg 
phe met pro arg 

gcc att ctg teg 
ala ile leu ser 

ggg agg acg acg 
gly arg thr thr 

ggt ccg tgt gtg 
gly pro cys val 

ggt cac cga gga 
gly his arg gly 



SEQ ID N0.12B 
FIGURE 12B 



REPLACEMENT SHEET (RULE 26) 



37/185 



1/2 

ccg aag gga ttt 
pro lys gly phe 
61/21 

cgt aga cag gaa 
arg arg gin glu 
121/41 

gtc ttt ccg agt 
val phe pro ser 
181/61 

ggc ccg ggc cgc 
gly pro gly arg 
241/81 

caa ccg cgc gga 
gin pro arg gly 
301/101 

cct ccg gag tga 
pro pro glu OPA 



cgc gac taa etc 
arg asp OCH leu 

gag aca ggg aag 
glu thr gly lys 

tea gca aca ate 
ser ala thr ile 

ttc ggg ccg agt 
phe gly pro ser 

att caa teg gat 
lie gin ser asp 

tc 



31/11 
ggc ctg taa ggc 
gly leu OCH gly 

91/31 
ctg atg acg teg 
leu met thr ser 

151/51 
gac aca gaa gcg 
asp thr glu ala 

211/71 
gtc tga gta aga 
val OPA val arg 

271/91 
ggc ggg egg gac 
gly gly arg asp 



aac gcg agg tct 
asn ala arg ser 

cgt ace gga ccg 
arg thr gly pro 

ggg acc aga ccg 
gly thr arg pro 

cca gag tea egg 
pro glu ser arg 

egg att gcg ccg 
arg ile ala pro 



tea tgc cga gga 
ser cys arg gly 

cca ttc tgt cga 
pro phe cys arg 

gga gga cga cgc 
gly gly arg arg 

gtc cgt gtg tga 
val arg val OPA 

gtc acc gag gaa 
val thr glu glu 



SEQ ID N0.12C 



FIGURE 12C 



1/1 

GGG ATT TCG TTG CCC GAT GGA TTG TTT GTA 
gly ile ser leu pro asp gly leu phe val 
61/21 

TAT TGG CAA TGC TGG AAA TGG ACA TTC CAA 
tyr trp gin cys trp lys trp thr phe gin 

121/41 

GGG GGG CAA GCG TTT GTA CCG GGG CCA GCA 
gly gly gin ala phe val pro gly pro ala 
181/61 

ATG TTG TTG TGT CAG CGC GGG CTT GGT CTC 
met leu leu cys gin arg gly leu gly leu 
241/81 

TCA AAA CAG GTT GAA CTT AAC GAC TCA AGA 
ser lys gin val glu leu asn asp ser arg 
301/101 

GGA CAC CAA TTT GAC TCG GCT CTT TGG CAA 
gly his gin phe asp ser ala leu trp gin 
361/121 

CGC ATC GTT GGC CTT GCC ATC AAT CGC CGG 
arg ile val gly leu ala ile asn arg arg 
421/141 

GGG ACC GAC CTC GAC CAG GGG TCC TTT GTG 
gly thr asp leu asp gin gly ser phe val 
481/161 

GTC GGT CAT CGC CTA AGG CTA CCG TTC TGA 
val gly his arg leu arg leu pro phe OPA 
541/181 

GGC ACG TCA TGT CTC AGC GGC CCA CCG CCA 
gly thr ser cys leu ser gly pro pro pro 
601/201 

CAG ATG ACT CCA CGC AGC CTT GTT CGC ATC 
gin met thr pro arg ser leu val arg ile 
661/221 

CTG GTG AGC GCA CCC GCC GGC GGT CGT GCC 
leu val ser ala pro ala gly gly arg ala 



31/11 


















CGG TTT 


GGG 


AAA 


AAC 


ACT 


TGA 


AGT 


CCT 


TTT 


arg phe 


gly lys 


asn 


thr 


OPA 


ser 


pro phe 


91/31 


















TAT TGC 


GCG 


AAT 


TAA 


CCG AAC 


ACG 


GTG 


AGG 


tyr cys 


ala 


asn 


OCH 


pro 


asn 


thr 


val 


arg 


151/51 


















AGC GCC 


GCC 


GAC 


CGG 


TTG 


ACC 


GAA 


GCC 


AGC 


ser ala 


ala 


asp 


arg 


leu 


thr 


glu 


ala 


ser 


211/71 


















GAT GTC 


CCG 


GCC 


TTG 


GCT 


GGA 


CCC 


GCT 


TCT 


asp val 


pro 


ala 


leu 


ala 


gly pro 


ala 


ser 


271/91 


















ACG GAA 


ACG 


CTT 


GAA 


CCG 


CGA 


CGT 


CGC 


TCC 


thr glu 


thr 


leu glu pro 


arg 


arg 


arg 


ser 


331/111 


















TTG AAG 


GTG 


AGC 


TGC 


GAG 


CAG 


CCG 


GGT 


GAC 


leu lys 


val 


ser 


cys 


glu 


gin pro gly 


asp 


391/131 


















CTC GCG 


GAC 


GTA 


GAT 


AAT 


CAG 


CTC 


ACC 


GTT 


leu ala 


asp 


val 


asp 


asn 


gin 


leu 


thr 


val 


451/151 


















ACT GCC 


GGG 


CTT 


GAC 


GCG 


GAC 


GAC 


CAC 


AGA 


thr ala 


gly 


leu 


asp 


ala 


asp 


asp 


his 


arg 


511/171 


















CCT GGG 


GCT 


GCG 


TGG 


GCG 


CCG 


ACG 


ACG 


TGA 


pro gly 


ala 


ala 


trp 


ala 


pro 


thr 


thr 


OPA 


571/191 


















CCT CGG 


TCG 


CCG 


GCA 


GTA 


TGT 


CAG 


CAT 


GTG 


pro arg 


ser 


pro 


ala 


val 


cys 


gin 


his 


val 


631/211 


















GTT GGT 


GTC 


GTG 


GTT 


GCG 


ACG 


ACC 


TTG 


GCG 


val gly 


val 


val 


val 


ala 


thr 


thr 


leu 


ala 


691/231 


















GCG CAT 


GCG 


GAT 


C 












ala his 


ala 


asp 















SEQ ID N0.13A 
FIGURE ISA 



REPLACEMENT SHEET (RULE 26) 



38/185 



GGA TTT 


CGT 


TGC 


CCG 


ATG 


gly phe 


arg 


cys 


pro 


met 


62/21 










ATT GGC 


AAT 


GCT 


GGA 


AAT 


ile gly 


asn 


ala 


gly 


asn 


122/41 










GGG GGC 


AAG 


CGT 


TTG 


TAG 


gly gly 


lys 


arg 


leu 


tyr 


182/61 










TGT TGT 


TGT 


GTC 


AGC 


GCG 


cys cys 


cys 


val 


ser 


ala 


242/81 










CAA AAC 


AGG 


TTG 


AAC 


TTA 


gin asn 


arg 


leu 


asn 


leu 


302/101 










GAG ACC 


AAT 


TTG 


ACT 




J asp thr 


asn 


leu 


thr 


arg 


3 362/121 










fj GGA TOG 


TTG 


GCC 


TTG 


CCA 


f=i ala ser 


leu 


ala 


leu 


pro 


422/141 










,^ GGA COG 


ACC 


TCG 


ACC 


AGG 


gly pro 


thr 


ser 


thr 


arg 


4oz/1d1 










H TCG GTC 


ATC 


GCC 


TAA 


GGC 


ser val 


ile 


ala 


OCH 


gly 


□ 542/181 










T\ GGA CGT 


CAT 


GTC 


TCA 


GCG 


^== ala arg 


his 


val 


ser 


ala 


n 602/201 










AGA TGA 


CTC 


CAC 


GCA 


GCC 


t arg OPA 


leu 


his 


ala 


ala 


662/221 










TGG TGA 


GCG 


CAC 


CCG 


CCG 


trp OPA 


ala 


his 


pro 


pro 















GAT 


TGT 


TTG 


TAC 




bGA 


asp 


cys 


leu 


tyr 


gly leu 


gly 










yz/ :5l 




GGA 


CAT 


TCC 


AAT 


A.i 1 GCG 


CGA 


gly 


his 


ser 


asn 


ile ala 


arg 










i CO /CI 
±Z)Z/ O 1 




CGG 


GGC 


CAG 


CAA 






arg 


gly 


gin 


gin 


ala pro 


pro 














GGC 


TTG 


GTC 


TCG 


ATG 1 CC 


CGG 


gly 


Xeu 


va± 


ser 


met ser 


arg 










Z / Z / 3 1 
















tnr 


unr 


gin 


glu 


arg lys 


arg 










/TIT 






mmm 
X X i. 


GGC 


AAT 


TGA AGG 


TGA 


leu 


phe 




asn 


^jir/\ arg 












/ X O X 












X V^VJ 


APG 


ser 


ixe 


ai.a 


gly 


ser arg 


unr 










OZ/ XO 1 








i 1 tj 


1 o/\ 


Clo CC\j 


ooC 


gly 


pro 


leu 


OPA 


leu pro 


gly 










O ±Z/ 1 / X 




X AC 




1 1 




X OOVT 


PTG 


tyr 


arg 


ser 


asp 


leu gly 


leu 










572/191 




GCC 


CAC 


CGC 


CAC 


CTC GGT 


CGC 


ala 


his 


arg 


his 


leu gly 


arg 










632/211 




TTG 


TTC 


GCA 


TCG 


TTG GTG 


TCG 


leu 


phe 


ala 


ser 


leu val 


ser 










692/231 




GCG 


GTC 


GTG 


CCG 


CGC ATG 


CGG 


ala 


val 


val 


pro 


arg met 


arg 



SEQ ID N0.13B 
FIGURE 13B 



AAA ACA CTT GAA GTC CTT TTT 
lys thr leu glu val leu phe 

ATT AAC CGA ACA CGG TGA GGG 
ile asn arg thr arg OPA gly 

ACC GGT TGA CCG AAG CCA GCA 
thr gly OPA pro lys pro ala 

CCT TGG CTG GAC CCG CTT CTT 
pro trp leu asp pro leu leu 

TTG AAC CGC GAC GTC GCT CCG 
leu asn arg asp val ala pro 

GCT GCG AGC AGC CGG GTG ACC 
ala ala ser ser arg val thr 

TAG ATA ATC AGC TCA CCG TTG 
AMB ile ile ser ser pro leu 

TTG ACG CGG ACG ACC ACA GAG 
leu thr arg thr thr thr glu 

CGT GGG CGC CGA CGA CGT GAG 
arg gly arg arg arg arg glu 

CGG CAG TAT GTC AGC ATG TGC 
arg gin tyr val ser met cys 

TGG TTG CGA CGA CCT TGG CGC 
trp leu arg arg pro trp arg 

ATC 
ile 



REPLACEMENT SHEET (RULE 26) 



39/185 



33/11 

GAT TTC GTT GCC CGA TGG ATT GTT TGT ACG GTT TGG GAA AAA CAC TTG AAG TCC TTT TTA 
asp phe vai ala arg trp ile val cys thr val trp glu lys his leu lys ser phe leu 
63/21 93/31 

TTG GCA ATG CTG GAA ATG GAG ATT CCA ATA TTG CGC GAA TTA ACC GAA CAC GGT GAG GGG 
leu ala met leu glu met asp ile pro ile leu arg glu leu thr glu his gly glu gly 
123/41 153/51 

GGG GCA AGC GTT TGT ACC GGG GCC AGC AAG CGC CGC CGA CCG GTT GAC CGA AGC CAG CAT 
gly ala ser val cys thr gly ala ser lys arg arg arg pro val asp arg ser gin his 
183/61 213/71 

GTT GTT GTG TCA GCG CGG GCT TGG TCT CGA TGT CCC GGC CTT GGC TGG ACC CGC TTC TTC 
val val val ser ala arg ala trp ser arg cys pro gly leu gly trp thr arg phe phe 
243/81 273/91 

AAA ACA GGT TGA ACT TAA CGA CTC AAG AAC GGA AAC GCT TGA ACC GCG ACG TCG CTC CGG 
lys thr gly CPA thr OCH arg leu lys asn gly asn ala OPA thr ala thr ser leu arg 
303/101 333/111 

ACA CCA ATT TGA CTC GGC TCT TTG GCA ATT GAA GGT GAG CTG CGA GCA GCC GGG TGA CCG 
thr pro ile OPA leu gly ser leu ala ile glu gly glu leu arg ala ala gly OPA pro 
363/121 393/131 

CAT CGT TGG CCT TGC CAT CAA TCG CCG GCT CGC GGA CGT AGA TAA TCA GCT CAC CGT TGG 
his arg trp pro cys his gin ser pro ala arg gly arg arg OCH ser ala his arg trp 
423/141 453/151 

GAC CGA CCT CGA CCA GGG GTC CTT TGT GAC TGC CGG GCT TGA CGC GGA CGA CCA CAG AGT 
asp arg pro arg pro gly val leu cys asp cys arg ala OPA arg gly arg pro gin ser 
483/161 513/171 

CGG TCA TCG CCT AAG GCT ACC GTT CTG ACC TGG GGC TGC GTG GGC GCC GAC GAC GTG AGG 
arg ser ser pro lys ala thr val leu thr trp gly cys val gly ala asp asp val arg 
543/181 573/191 

CAC GTC ATG TCT CAG CGG CCC ACC GCC ACC TCG GTC GCC GGC AGT ATG TCA GCA TGT GCA 
his val met ser gin arg pro thr ala thr ser val ala gly ser met ser ala cys ala 
603/201 633/211 

GAT GAC TCC ACG CAG CCT TGT TCG CAT CGT TGG TGT CGT GGT TGC GAC GAC CTT GGC GCT 

asp asp ser thr gin pro cys ser his arg trp cys arg gly cys asp asp leu gly ala 

663/221 693/231 

GGT GAG CGC ACC CGC CGG CGG TCG TGC CGC GCA TGC GGA TC 

gly glu arg thr arg arg arg ser cys arg ala cys gly 



SEQ ID N0.13C 
FIGURE 13C 



REPLACEMENT SHEET (RULE 26) 



40/185 



part of 


the nucleotide sequence 


of seql3A 


















1/1 










31/11 


















GGG TCC 


TTT GTG ACT GCC GGG 


CTT 


GAC 


GCG 


GAC GAC 


CAC 


AGA 


GTC 


GGT 


CAT 


CGC 


CTA 


AGG 


gly ser 


phe val thr ala gly 


leu 


asp 


ala 


asp asp 


his 


arg 


val 


gly 


his 


arg 


leu 


arg 


61/21 










91/31 


















CTA CCG 


TTC TGA CCT GGG GCT 


GCG 


TGG 


GCG 


CCG ACG 


ACG 


TGA 


GGC 


ACG 


TCA 


TGT 


CTC 


AGC 


leu pro 


phe CPA pro gly ala 


ala 


trp 


ala 


pro thr 


thr 


OP A gly 


thr 


ser 


cys 


leu 


ser 


121/41 










151/51 


















GGC CCA 


CCG CCA CCT CGG TCG 


CCG 


GCA 


GTA 


TGT CAG 


CAT 


GTG 


CAG 


ATG 


ACT 


CCA 


CGC 


AGC 


gly pro pro pro pro arg ser 


pro 


ala 


val 


cys gin 


his 


val 


gin 


met 


thr 


pro 


arg 


ser 


181/61 










211/71 


















CTT GTT 


CGC ATC GTT GGT GTC 


GTG 


GTT 


GCG 


ACG ACC 


TTG 


GCG 


CTG 


GTG 


AGC 


GCA 


CCC 


GCC 


leu val 


arg ile val gly val 


val 


val 


ala 


thr thr 


leu 


ala 


leu 


val 


ser 


ala 


pro 


ala 



241/81 

GGC GGT CGT GCC GCG CAT GCG GAT C 
gly gly arg ala ala his ala asp 



SEQ ID No.l3A^ 
FIGURE 13A' 



1/1 31/11 



GGT CCT 


TTG 


TGA 


CTG 


CCG 


GGC 


TTG 


ACG 


CGG 


ACG ACC 


ACA 


GAG 


TCG 


GTC 


ATC 


GCC 


TAA 


GGC 


gly pro 


leu 


OPA 


leu 


pro 


gly 


leu 


thr 


arg 


thr thr 


thr 


glu 


ser 


val 


ile 


ala 


OCH 


gly 


61/21 


















91/31 


















TAC CGT 


TCT 


GAC 


CTG 


GGG 


CTG 


CGT 


GGG 


CGC 


CGA CGA 


CGT 


GAG 


GCA 


CGT 


CAT 


GTC 


TCA 


GCG 


tyr arg 


ser 


asp 


leu 


gly 


leu 


arg 


gly 


arg 


arg arg 


arg 


glu ala 


arg 


his 


val 


ser 


ala 


121/41 


















151/51 


















GCC CAC 


CGC 


CAC 


CTC 


GGT 


CGC 


CGG 


CAG 


TAT 


GTC AGC 


ATG 


TGC 


AGA 


TGA 


CTC 


CAC 


GCA 


GCC 


ala his 


arg 


his 


leu 


gly 


arg 


arg 


gin 


tyr 


val ser 


met 


cys 


arg 


OPA 


leu 


his 


ala 


ala 


181/61 
















211/71 


















TTG TTC 


GCA 


TCG 


TTG 


GTG 


TCG 


TGG 


TTG 


CGA 


CGA CCT 


TGG 


CGC 


TGG 


TGA 


GCG 


CAC 


CCG 


CCG 


leu phe 


ala 


ser 


leu 


val 


ser 


trp 


leu 


arg 


arg pro 


trp 


arg 


trp 


OPA 


ala 


his 


pro 


pro 



241/81 

GCG GTC GTG CCG CGC ATG CGG ATC 
ala val val pro arg met arg ile 



SEQ ID 

FIGURE 13B' 



1/1 


















31/11 


















GTC CTT 


TGT 


GAC 


TGC 


CGG 


GCT 


TGA 


CGC 


GGA 


CGA CCA 


CAG 


AGT 


CGG 


TCA 


TCG 


CCT 


AAG 


GCT 


val leu 


cys 


asp 


cys 


arg 


ala 


OPA 


arg 


gly 


arg pro 


gin 


ser 


arg 


ser 


ser 


pro 


lys 


ala 


61/21 


















91/31 
















CGG 


ACC GTT 


CTG 


ACC 


TGG 


GGC 


TGC 


GTG 


GGC 


GCC 


GAC GAC 


GTG 


AGG 


CAC 


GTC 


ATG 


TCT 


CAG 


thr val 


leu 


thr 


trp gly cys 


val 


gly 


ala 


asp asp 


val 


arg 


his 


val 


met 


ser 


gin 


arg 


121/41 


















151/51 


















CCC ACC 


GCC 


ACC 


TCG 


GTC 


GCC 


GGC 


AGT 


ATG 


TCA GCA 


TGT 


GCA 


GAT 


GAC 


TCC 


ACG 


CAG 


CCT 


pro thr 


ala 


thr 


ser 


val 


ala gly 


ser 


met 


ser ala 


cys 


ala 


asp 


asp 


ser 


thr 


gin pro 


181/61 


















211/71 


















TGT TCG 


CAT 


CGT 


TGG 


TGT 


CGT 


GGT 


TGC 


GAC 


GAC CTT 


GGC 


GCT 


GGT 


GAG 


CGC 


ACC 


CGC 


CGG 


cys ser 


his 


arg 


trp 


cys 


arg 


gly 


cys 


asp 


asp leu 


gly 


ala 


gly 


glu arg 


thr 


arg 


arg 



241/81 

CGG TCG TGC CGC GCA TGC GGA TC 
arg ser cys arg ala cys gly 

SEQ ID No,13C' 
FIGURE 13C' 



REPLACEMENT SHEET (RULE 26) 



41/185 



sequence Rvl984c predicted by Cole et al . (Nature 393:537-544) and containing 
seqlSA' 

1/1 31/11 



atg act 


cca 


cgc 


age 


ctt 


gtt 


cgc 


ate 


gtt 


ggt gtc 


gtg 


gtt 


gcg 


acg 


acc 


ttg 


gcg 


ctg 


Met thr 


pro 


arg 


ser 


leu 


val 


arg 


ile 


val 


gly val 


val 


val 


ala 


thr 


thr 


leu 


ala 


leu 


61/21 


















91/31 


















gtg age 


gca 


ccc 


gcc 


ggc 


ggt 


cgt 


gcc 


gcg 


cat gcg 


gat 


ccg 


tgt 


teg 


gac 


ate 


gcg 


gtc 


val ser 


ala 


pro 


ala 


gly 


gly 


arg 


ala 


ala 


his ala 


asp 


pro 


cys 


ser 


asp 


ile 


ala 


val 


121/41 


















151/51 


















gtt ttc 


get 


cgc 


gge 


acg 


cat 


cag 


get 


tct 


ggt ctt 


ggc 


gac 


gtc 


ggt 


gag 


gcg 


ttc 


gtc 


val phe 


ala 


arg 


gly 


thr 


his 


gin 


ala 


ser 


gly leu 


gly 


asp 


val 


gly glu 


ala 


phe 


val 


181/61 


















211/71 


















gac teg 


ctt 


ace 


teg 


caa 


gtt 


ggc 


ggg 


egg 


teg att 


ggg 


gtc 


tac 


gcg 


gtg 


aac 


tac 


cca 


asp ser 


leu 


thr 


ser 


gin 


val 


gly 


gly 


arg 


ser ile 


gly 


val 


tyr 


ala 


val 


asn 


tyr 


pro 


241/81 


















271/91 


















gca age 


gae 


gae 


tac 


cgc 


gcg 


age 


gcg 


tea 


aac ggt 


tec 


gat 


gat 


gcg 


age 


gee 


cac 


ate 


ala ser 


asp 


asp 


tyr 


arg 


ala 


ser 


ala 


ser 


asn gly 


ser 


asp 


asp 


ala 


ser 


ala 


his 


ile 


301/101 


















331/111 


















cag cgc 


ace 


gtc 


gcc 


age 


tgc 


ccg 


aac 


acc 


agg att 


gtg 


ctt 


ggt 


ggc 


tat 


teg 


cag 


ggt 


gin arg 


thr 


val 


ala 


ser 


cys 


pro 


asn 


thr 


arg ile 


val 


leu 


gly 


gly 


tyr 


ser 


"gin 


gly 


361/121 


















391/131 


















gcg acg 


gtc 


ate 


gat 


ttg 


tec 


ace 


teg 


gcg 


atg ccg 


ccc 


gcg 


gtg 


gca 


gat 


cat 


gtc 


gcc 


ala thr 


val 


ile 


asp 


leu 


ser 


thr 


ser 


ala 


met pro 


pro 


ala 


val 


ala 


asp 


his 


val 


ala 


421/141 


















451/151 


















get gte 


gee 


ctt 


ttc 


gge 


gag 


cca 


tec 


agt 


ggt ttc 


tee 


age 


atg 


ttg 


tgg 


ggc 


ggc 


ggg 


ala val 


ala 


leu 


phe 


gly 


glu 


pro 


ser 


ser 


gly phe 


ser 


ser 


met 


leu 


trp 


gly 


gly 


gly 


481/161 


















511/171 


















teg ttg 


ccg 


aca 


ate 


ggt 


ccg 


ctg 


tat 


age 


tct aag 


acc 


ata 


aac 


ttg 


tgt 


get 


ccc 


gac 


ser leu 


pro 


thr 


ile 


gly 


pro 


leu 


tyr 


ser 


ser lys 


thr 


ile 


asn 


leu 


cys 


ala 


pro 


asp 


541/181 


















571/191 


















gat cca 


ata 


tge 


acc 


gga 


ggc 


ggc 


aat 


att 


atg gcg 


cat 


gtt 


teg 


tat 


gtt 


cag 


teg 


ggg 


asp pro 


ile 


cys 


thr 


gly 


gly 


gly 


asn 


ile 


met ala 


his 


val 


ser 


tyr 


val 


gin 


ser 


gly 


601/201 


















631/211 


















atg aca 


age 


cag 


gcg 


gcg 


aca 


ttc 


gcg 


gcg 


aac agg 


etc 


gat 


cac 


gee 


gga 


tga 






met thr 


ser 


gin 


ala 


ala 


thr 


phe 


ala 


ala 


asn arg 


leu 


asp 


his 


ala 


gly 


OPA 







SEQ ID N0.13D 
FIGURE 13D 



REPLACEMENT SHEET (RULE 26) 



42/185 



SeqlSF: ORF predicted by Cole et al . (Nature 393:537-544) and containing Rvl984c 



1/1 31/11 



tga ggc 


acg 


tea 


tgt 


etc 


age 


ggc 


cca 


ccg 


cca cet 


egg 


teg 


ccg 


gca 


gta 


tgt 


cag 


cat 


OPA gly 


thr 


ser 


cys 


leu 


ser 


gly pro 


pro 


pro pro 


arg 


ser 


pro 


ala 


val 


cys 


gin 


his 


61/21 


















91/31 


















gtg cag 


atg 


act 


cca 


cgc 


age 


ctt 


gtt 


cgc 


ate gtt 


ggt 


gtc 


gtg 


gtt 


gcg 


acg 


ace 


ttg 


val gin 


met 


thr 


pro 


arg 


ser 


leu 


val 


arg 


ile val 


gly 


val 


val 


val 


ala 


thr 


thr 


leu 


121/41 


















151/51 


















gcg Gtg 


gtg 


age 


gca 


ccc 


gcc 


ggc 


ggt 


cgt 


gcc gcg 


cat 


gcg 


gat 


ccg 


tgt 


teg 


gac 


ate 


ala leu 


val 


ser 


ala 


pro 


ala 


gly 


gly 


arg 


ala ala 


his 


ala 


asp 


pro 


cys 


ser 


asp 


ile 


181/61 


















211/71 


















gcg gtc 


gtt 


ttc 


get 


cgc 


ggc 


acg 


cat 


cag 


get tct 


ggt 


ctt 


ggc 


gac 


gtc 


ggt 


gag 


gcg 


ala val 


val 


phe 


ala 


arg 


gly 


thr 


his 


gin 


ala ser 


gly 


leu 


gly 


asp 


val 


gly 


glu ■ 


ala 


241/81 


















271/91 


















ttc gtc 


gac 


teg 


ctt 


ace 


teg 


caa 


gtt 


ggc 


ggg egg 


teg 


att 


ggg 


gtc 


tac 


gcg 


gtg 


aac 


phe val 


asp 


ser 


leu 


thr 


ser 


gin 


val 


gly gly arg 


ser 


ile 


gly 


val 


tyr 


ala 


val 


asn 


301/101 


















331/111 


















tac cca 


gca 


age 


gac 


gac 


tac 


cgc 


gcg 


age 


gcg tea 


aac 


ggt 


tec 


gat 


gat 


gcg 


age 


gee 


tyr pro 


ala 


ser 


asp 


asp 


tyr 


arg 


ala 


ser 


ala ser 


asn 


gly 


ser 


asp 


asp 


ala 


ser 


ala 


361/121 


















391/131 


















cac ate 


cag 


cgc 


ace 


gtc 


gcc 


age 


tgc 


ccg 


aac ace 


agg 


att 


gtg 


ctt 


ggt 


ggc 


tat 


teg 


his lie 


gin 


arg 


thr 


val 


ala 


ser 


cys 


pro 


asn thr 


arg 


ile 


val 


leu 


gly 


gly 


tyr 


ser 


421/141 
















451/151 


















cag ggt 


gcg 


acg 


gtc 


ate 


gat 


ttg 


tee 


acc 


teg gcg 


atg 


ccg 


ccc 


gcg 


gtg 


gca 


gat 


cat 


gin gly 


ala 


thr 


val 


ile 


asp 


leu 


ser 


thr 


ser ala 


met 


pro 


pro 


ala 


val 


ala 


asp 


his 


481/161 


















511/171 


















gtc gcc 


get 


gtc 


gcc 


ctt 


ttc 


ggc 


gag 


cca 


tec agt 


ggt 


ttc 


tec 


age 


atg 


ttg 


tgg 


ggc 


val ala 


ala 


val 


ala 


leu 


phe 


gly 


glu 


pro 


ser ser 


gly phe 


ser 


ser 


met 


leu 


trp 


gly 


541/181 


















571/191 


















ggc ggg 


teg 


ttg 


ccg 


aca 


ate 


ggt 


ccg 


etg 


tat age 


tct 


aag 


ace 


ata 


aac 


ttg 


tgt 


get 


gly gly 


ser 


leu 


pro 


thr 


ile 


gly pro 


leu 


tyr ser 


ser 


lys 


thr 


ile 


asn 


leu 


cys 


ala 


601/201 


















631/211 


















ccc gac 


gat 


cca 


ata 


tgc 


ace 


gga 


ggc 


ggc 


aat att 


atg 


gcg 


cat 


gtt 


teg 


tat 


gtt 


cag 


pro asp 


asp 


pro 


ile 


cys 


thr 


gly gly gly 


asn ile 


met 


ala 


his 


val 


ser 


tyr 


val 


gin 


661/221 


















691/231 


















teg ggg 


atg 


aca 


age 


cag 


gcg 


gcg 


aca 


ttc 


gcg gcg 


aac 


agg 


etc 


gat 


cac 


gee 


gga 


tga 


ser gly 


met 


thr 


ser 


gin 


ala 


ala 


thr 


phe 


ala ala 


asn 


arg 


leu 


asp 


his 


ala 


gly 


OPA 



SEQ ID N0.13F 
FIGURE 13F 



REPLACEMENT SHEET (RULE 26) 



43/185 





















31/11 


















CCA CCG 


GGG 


CTG 


GAG 


GGG 


CGA 


ATG 


TGC 


GCC 


GAA CGC 


CGT 


CGG 


CCA 


ACT 


TGG 


CCG 


CTG 


AGG 


pro pro 


gly 


leu 


glu 


gly 


arg 


met 


cys 


ala 


glu arg 


arg 


arg 


pro 


thr 


trp 


pro 


leu 


arg 


61/21 


















91/31 


















GCG GCT 


GAT 


CCC 


CTG 


GCC 


CGA 


GAC 


GGG 


GCA 


AGC CAA 


TAG 


CGG 


CTC 


CAT 


CGG 


GCT 


TTG 


CTG 


ala ala 


asp 


pro 


leu 


ala 


arg 


asp 


gly 


ala 


ser gin 


AMB 


arg 


leu 


his 


arg 


ala 


leu 


leu 


121/41 














151/51 


















GTA GCG 


GTT 


CGG 


CGG 


GAA 


CCG 


AGC 


GCC 


GAC 


GTT GTC 


GGT 


GCC 


CGG 


TGA 


TAT 


ATT 


GGG 


TCA 


val ala 


val 


arg 


arg 


glu 


pro 


ser 


ala 


asp 


val val 


gly 


ala 


arg 


OPA 


tyr 


ile 


gly 


ser 


181/61 


















211/71 


















GAC GGG 


TAT 


GGC 


GGC 


GAC 


TGA 


GGT 


GAT 


CTG 


CGA CAC 


GCC 


GCC 


GCG 


GTG 


CTC 


GAG 


CCA 


GGC 


asp gly 


tyr 


gly 


gly 


asp 


OPA gly 


asp 


leu 


arg his 


ala 


ala 


ala 


val 


leu 


glu 


pro 


gly 


241/81 


















271/91 


















TTA CGA 


CCA 


GGG 


AAT 


TTC 


GAA 


AAT 


GTT 


ATT 


CAG AAC 


ATC 


TTG 


TAT 


CTC 


TTC 


CTC 


CGT 


GCC 


leu arg 


pro 


gly 


asn 


phe 


glu 


asn 


val 


lie 


gin asn 


lie 


leu 


tyr 


leu 


phe 


leu 


arg 


ala 


301/101 
















331/111 


















ACC CCC 


TAG 


GTG 


TAG 


TGT 


TTT 


CGA 


GTA 


CCG 


GCA GAT 


CCC 


AGT 


TCA 


CCA 


GTC 


TCA 


CCA 


GAT 


thr pro 


AMB 


val 


AMB 


cys 


phe 


arg 


val 


pro 


ala asp 


pro 


ser 


ser 


pro 


val 


ser 


pro 


asp 



C 

SEQ ID N0.14A 
FIGURE 14A 





















32/11 


















CAC CGG 


GGC 


TGG 


AGG 


GGC 


GAA 


TGT 


GCG 


CCG 


AAC GCC 


GTC 


GGC 


CAA 


CTT 


GGC 


CGC 


TGA 


GGG 


his arg 


gly 


trp 


arg 


gly glu 


cys 


ala 


pro 


asn ala 


val 


gly gin 


leu 


gly 


arg 


OPA gly 


62/21 
















92/31 


















CGG CTG 


ATC 


CCC 


TGG 


CCC 


GAG 


ACG 


GGG 


CAA 


GCC AAT 


AGC 


GGC 


TCC 


ATC 


GGG 


CTT 


TGC 


TGG 


arg leu 


ile 


pro 


trp 


pro 


glu 


thr 


gly 


gin 


ala asn 


ser 


gly 


ser 


ile 


gly 


leu 


cys 


trp 


122/41 
















152/51 


















TAG CGG 


TTC 


GGC 


GGG 


AAC 


CGA 


GCG 


CCG 


ACG 


TTG TCG 


GTG 


CCC 


GGT 


GAT 


ATA 


TTG 


GGT 


CAG 


AMB arg 


phe 


gly 


gly 


asn 


arg 


ala 


pro 


thr 


leu ser 


val 


pro 


gly 


asp 


ile 


leu 


gly 


gin 


182/61 
















212/71 


















ACG GGT 


ATG 


GCG 


GCG 


ACT 


GAG 


GTG 


ATC 


TGC 


GAC ACG 


CCG 


CCG 


CGG 


TGC 


TCG 


AGC 


CAG 


GCT 


thr gly 


met 


ala 


ala 


thr 


glu 


val 


ile 


cys 


asp thr 


pro 


pro 


arg 


cys 


ser 


ser 


gin 


ala 


242/81 


















272/91 


















TAC GAC 


CAG 


GGA 


ATT 


TCG 


AAA 


ATG 


TTA 


TTC 


AGA ACA 


TCT 


TGT 


ATC 


TCT 


TCC 


TCC 


GTG 


CCA 


tyr asp 


gin 


gly 


ile 


ser 


lys 


met 


leu 


phe 


arg thr 


ser 


cys 


ile 


ser 


ser 


ser 


val 


pro 


302/101 


















332/111 
















ATC 


CCC CCT 


AGG 


TGT 


AGT 


GTT 


TTC 


GAG 


TAC 


CGG 


CAG ATC 


CCA 


GTT 


CAC 


CAG 


TCT 


CAC 


CAG 


pro pro 


arg 


cys 


ser 


val 


phe 


glu 


tyr 


arg 


gin ile 


pro 


val 


his 


gin 


ser 


his 


gin 


ile 



SEQ ID No,14B 
FIGURE 14B 



REPLACEMENT SHEET {RULE 26) 



44/185 



33/11 



ACC GGG 


GCT 


GGA 


GGG 


GCG 


AAT 


GTG 


CGC 


CGA 


ACG CCG 


TCG 


GCC 


AAC 


TTG 


GCC 


GCT 


GAG 


GGC 


thr gly 


ala 


gly 


gly 


ala 


asn 


val 


arg 


arg 


thr pro 


ser 


ala 


asn 


leu 


ala 


ala 


glu 


gly 


63/21 


















93/31 


















GGC TGA 


TCC 


CCT 


GGC 


CCG 


AGA 


CGG 


GGC 


AAG 


CCA ATA 


GCG 


GCT 


CCA 


TCG 


GGC 


TTT 


GCT 


GGT 


gly OPA 


ser 


pro 


gly 


pro 


arg 


arg 


gly 


lys 


pro lie 


ala 


ala 


pro 


ser 


gly 


phe 


ala 


gly 


123/41 


















153/51 


















AGC GGT 


TCG 


GCG 


GGA 


ACC 


GAG 


CGC 


CGA 


CGT 


TGT CGG 


TGC 


CCG 


GTG 


ATA 


TAT 


TGG 


GTC 


AGA 


ser gly 


ser 


ala 


gly 


thr 


glu 


arg 


arg 


arg 


cys arg 


cys 


pro 


val 


lie 


tyr 


trp 


val 


arg 


183/61 
















213/71 


















GGG GTA 


TGG 


CGG 


CGA 


CTG 


AGG 


TGA 


TCT 


GCG 


ACA CGC 


CGC 


CGC 


GGT 


GCT 


CGA 


GCC 


AGG 


CTT 


arg val 


trp 


arg 


arg 


leu 


arg 


OPA 


ser 


ala 


thr arg 


arg 


arg 


gly 


ala 


arg 


ala 


arg 


leu 


243/81 
















273/91 


















ACG ACC 


AGG 


GAA 


TTT 


CGA 


AAA 


TGT 


TAT 


TCA 


GAA CAT 


CTT 


GTA 


TCT 


CTT 


CCT 


CCG 


TGC 


CAC 


thr thr 


arg 


glu 


phe 


arg 


lys 


cys 


tyr 


ser 


glu his 


leu 


val 


ser 


leu 


pro 


pro 


cys 


his 


303/101 














333/111 


















CCC CTA 


GGT 


GTA 


GTG 


TTT 


TCG 


AGT 


ACC 


GGC 


AGA TCC 


CAG 


TTC 


ACC 


AGT 


CTC 


ACC 


AGA 


TC 


pro leu 


gly 


val 


val 


phe 


ser 


ser 


thr 


gly 


arg ser 


gin 


phe 


thr 


ser 


leu 


thr 


arg 





SEQ ID N0.14C 
FIGURE 14C 



part of the nucleotide sequence of seql4A 
1/1 31/11 

TTT TCG AGT ACC GGC AGA TCC CAG GTT CAC CAG GTC TCA CCA GAT C 
phe ser ser thr gly arg ser gin val his gin val ser pro asp 



SEQ ID No,14A' 
FIGURE 14A' 



1/1 

TGT TTT CGA GTA CCG GCA GAT CCC 
cys phe arg val pro ala asp pro 



31/11 

AGG TTC ACC AGG TCT CAC CAG ATC 
arg phe thr arg ser his gin ile 



SEQ ID N0.14C 
FIGURE 14C 



1/1 ' 31/11 

GTT TTC GAG TAC CGG CAG ATC CCA GGT TCA CCA GGT CTC ACC AGA TC 
val phe glu tyr arg gin ile pro gly ser pro gly leu thr arg 



SEQ ID N0.14C' 
FIGURE 14C^ 



REPLACEMENT SHEET (RULE 26) 



45/185 



ORF predicted based on the sequence published by Cole et al, (Nature 393:537- 
544) and containing seql4A' 



1/1 


















"3 1 /II 
O 1/ 1 1 


















TAG CGG 


TTG 


GGG 


GGG 


AAG 


GTA 








rnrp r~« rp 
i 1 tj i 




C2.CC 






ATA 


TTG 


GGT 


CAG 


AMB arg 


phe 


gly 


gly 


lys 


ieu 


ala 


ala 


unr 


leu ser 


va X 


d X d 


gly 


asp 


-i 1 ^ 

J L t. 


leu 




gin 


61/21 


















y 1/ o 1 


















ACG GGT 


ATG 


GGG 


GCG 


GGT 


GAG 


GTG 


ATG 








CCCL 


ccc 








CAG 


GCT 


thr gly 


met 


ala 


ala 


ala 


glu 


val 


He 


cys 


asp thr 


pro 


pro 


arg 


cys 


ser 


ser 


gin 


d X d 


I O T / y1 1 

IZ 1/ 4 J. 


















151/51 


















TAG GAG 


GAG 


GGA 


ATT 


TCG 


AAA 


ATG 


TTA 


TTC 


AGA ACA 


TCT 


TGT 


ATG 


TCT 


TCT 


CCG 


TGC 


CAC 


tyr asp 


gin 


gly 


ile 


sar 


lys 


met 


leu 


phe 


arg thr 


ser 


cys 


ile 


ser 


ser 


pro 


cys 


his 


181/61 
















211/71 


















CGG GTA 


GGT 


GTA 


GTG 


TTT 


TCG 


AGT 


ACC 


GGC 


AGA TCG 


CAG 


GTT 


CAC 


CAG 


GTC 


TCA 


CCA 


gat 


pro leu 


gly 


val 


val 


phe 


ser 


ser 


thr 


gly 


arg ser 


gin 


val 


his 


gin 


val 


ser 


pro 


asp 


241/81 
















271/91 


















oca egg 


ggc 


gcg 


atg 


aac 


ttc 


ccg 


gca 


teg 


gca teg 


cca 


ggt 


cga 


egg 


acg 


tgg 


teg 


ege 


pro arg 


gly 


ala 


met 


asn 


phe 


pro 


ala 


ser 


ala ser 


pro 


gly 


arg 


arg 


thr 


trp 


ser 


arg 


301/101 
















331/111 


















tat gac 


ggg 


aat 


ctg 


gag 


cct 


tgt 


egg 


gee 


get eaa 


cat 


ate 


gaa 


gat 


gea 


eta 


ett 


gag 


tyr asp 


gly 


asn 


leu 


glu 


pro 


cys 


arg 


ala 


ala gin 


his 


ile 


glu 


asp 


ala 


leu 


leu 


glu 


361/121 


















391/131 


















teg ttg 


cca 


gat 


act 


gtc 


aga 


ttc 


ccg 


att 


tec gca 


aag 


gag 


egg 


tac 


gcc 


cat 


gac 


cgt 


ser leu 


pro 


asp 


pro 


val 


arg 


phe 


pro 


ile 


ser ala 


lys 


glu 


arg 


tyr 


ala 


his 


asp 


arg 



421/141 

gac egt tta eac taa 
asp arg leu his OCH 



SEQ ID No,14F 
FIGURE 14F 



REPLACEMENT SHEET (RULE 26) 



46/185 



Sequence Rv3054c predicted by Cole et al. (Nature 393:537-544) 
which may be in phase with Seql4A' 

1/1 31/11 

gtg tea gat acc aag tec gac ate aaa ate ttg gee tta gtg gga age ctg cgc geg gcg 
val ser asp thr lys ser asp ile lys ile leu ala leu val gly ser leu arg ala ala 
61/21 91/31 

teg ttc aae cgc cag ate gee gag ctg get gee aag gtc get eeg gac ggc gte acc gtc 
ser phe asn arg gin ile ala glu leu ala ala lys val ala pro asp gly val thr val 
121/41 151/51 

acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aae gaa gac ate gac aca gcg aeg 
thr met phe glu gly leu gly asp leu pro phe tyr asn glu asp ile asp thr ala thr 
181/61 211/71 

gag gtg ccg gcg ccg gtg age geg ttg egg gag gcc gcg tet gac gcg cac get gee ttg 
qlu val pro ala pro val ser ala leu arg glu ala ala ser asp ala his ala ala leu 
241/81 271/91 

gtg gtc aeg ccg gaa tac aae ggc age att ccg gcc gtg ate aag aac gcg ate gac tgg 
val val thr pro glu tyr asn gly ser ile pro ala val ile lys asn ala ile asp trp 
301/101 331/111 

ctg tec agg cea ttc ggc gat ggc gcg ttg aag gac aag ccg ttg gcc gtg ate ggc ggc 
leu ser arg pro phe gly asp gly ala leu lys asp lys pro leu ala val ile gly gly 
361/121 391/131 

tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag act cgc aag teg ttc age ate 
ser met gly arg tyr gly gly val trp ala his asp glu thr arg lys ser phe ser ile 
421/141 451/151 

get ggc aeg egg gtg gtc gat gcg ate aaa ctg teg gtg ccg ttc caa act ctg ggc aag 
ala gly thr arg val val asp ala ile lys leu ser val pro phe gin thr leu gly lys 
481/161 511/171 

teg gtc gcg gac gac gee ggg ctg gcg gcg aat gtg cgc gac gee gtc ggc aac ttg gcc 
ser val ala asp asp ala gly leu ala ala asn val arg asp ala val gly asn leu ala 
541/181 

get gag gtc ggc tga 
ala glu val gly OPA 



SEQ ID N0.14R 



FIGURE 14R 



REPLACEMENT SHEET (RULE 26) 



47/185 



ORF predicted by Cole et al , (Nature 393:537-544) and containing Rv3054c 
1/1 31/11 

taa cgc gat egg aat aaa teg gac cat ggt ccg gtt ggc teg tgc aag gac gtg gac caa 
OCH arg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 
61/21 51/31 

caa gcg gaa agg aac gta gca gtg tea gat acc aag tec gac ate aaa ate ttg gee tta 
gin ala glu arg asn val ala val ser asp thr lys ser asp lie lys ile leu ala leu 
121/41 151/51 

gtg gga age ctg cgc gcg gcg teg ttc aac cgc eag ate gee gag etg get gee aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
181/61 211/71 

get ccg gae ggc gtc acc gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe glu gly leu gly asp leu pro phe tyr asn 
241/81 271/91 

gaa gac ate gac aca gcg aeg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gee gcg 
glu asp ile asp thr ala thr glu val pro ala pro val ser ala leu arg glu ala ala 
301/101 331/111 

tct gac gcg cac get gee ttg gtg gtc aeg ccg gaa tac aac ggc age att ccg gee gtg 
ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser ile pro ala val 
361/121 391/131 

ate aag aac gcg ate gac tgg etg tee agg cea ttc ggc gat ggc gcg ttg aag gac aag 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lys asp lys 
421/141 451/151 

ccg ttg gee gtg ate ggc ggc tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
481/161 511/171 

act cgc aag teg ttc age ate get ggc aeg egg gtg gtc gat gcg ate aaa ctg teg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 
541/181 571/191 

ccg ttc caa act ctg ggc aag teg gtc gcg gae gac gee ggg ctg gcg gcg aat gtg cgc 
pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
601/201 631/211 
gae gee gtc ggc aac ttg gee get gag gtc ggc tga 
asp ala val gly asn leu ala ala glu val gly OPA 



SEQ ID N0.14P 



FIGURE 14P 



REPLACEMENT SHEET (RULE 26) 



48/185 



fragment based on the sequence published by Cole et al . (Nature 393:537-544) and 
containing seq 14F' and seq 14P' 

1/1 31/11 

taa cgc gat egg aat aaa teg gac cat ggt ccg gtt ggc teg tgc aag gac gtg gac caa 
OCH arg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 
asn ala ile gly ile asn arg thr met val arg leu ala arg ala arg thr trp thr asn 
thr arg ser glu OCH ile gly pro trp ser gly trp leu val gin gly arg gly pro thr 
61/21 91/31 

caa gcg gaa agg aac gta gca gtg tea gat acc aag tec gac ate aaa ate ttg gee tta 
gin ala glu arg asn val ala val ser asp thr lys ser asp ile lys ile leu ala leu 
lys arg lys gly thr AMB gin cys gin ile pro ser pro thr ser lys ser trp pro AMB 
ser gly lys glu arg ser ser val arg tyr gin val arg his gin asn leu gly leu ser 
121/41 151/51 

gtg gga age ctg cgc gcg gcg teg ttc aac cgc cag ate gee gag ctg get gee aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
trp glu ala cys ala arg arg arg ser thr ala arg ser pro ser trp leu pro arg ser 
gly lys pro ala arg gly val val gin pro pro asp arg arg ala gl^ cys gin gly arg 
181/61 211/71 

get ccg gac ggc gtc acc gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe glu gly leu gly asp leu pro phe tyr asn 
leu arg thr ala ser pro ser pro cys ser arg gly trp gly thr cys arg ser thr thr 
ser gly arg arg his arg his his val arg gly ala gly gly pro ala val leu gin arg 
241/81 271/91 

gaa gac ate gac aca gcg acg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gee gcg 
glu asp ile asp thr ala thr glu val pro ala pro val ser ala leu arg glu ala ala 
lys thr ser thr gin arg arg arg cys arg arg arg OPA ala arg cys gly arg pro arg 
arg his arg his ser asp gly gly ala gly ala gly glu arg val ala gly gly arg val 
301/101 331/111 

tct gac gcg cac get gee ttg gtg gtc acg ccg gaa tac aac ggc age att ccg gee gtg 
ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser ile pro ala val 
leu thr arg thr leu pro trp trp ser arg arg asn thr thr ala ala phe arg pro OPA 
OPA arg ala arg cys leu gly gly his ala gly ile gin arg gin his ser gly arg asp 
361/121 391/131 

ate aag aac gcg ate gac tgg ctg tec agg cea ttc ggc gat ggc gcg ttg aag gac aag 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lys asp lys 
ser arg thr arg ser thr gly cys pro gly his ser ala met ala arg OPA arg thr ser 
gin glu arg asp arg leu ala val gin ala ile arg arg trp arg val glu gly gin ala 
421/141 451/151 

ccg ttg gcc gtg ate ggc ggc tee atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
arg trp pro OPA ser ala ala pro trp ala ala thr ala gly tyr gly arg thr thr arg 
val gly arg asp arg arg leu his gly pro leu arg arg gly met gly ala arg arg asp 
481/161 511/171 

act cgc aag teg ttc age ate get ggc acg egg gtg gtc gat gcg ate aaa ctg teg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 
leu ala ser arg ser ala ser leu ala arg gly trp ser met arg ser asn cys arg cys 
ser gin val val gin his arg trp his ala gly gly arg cys asp gin thr val gly ala 
541/181 571/191 

ccg ttc caa act ctg ggc aag teg gtc gcg gac gac gee ggg ctg gcg gcg aat gtg cgc 
pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
arg ser lys leu trp ala ser arg ser arg thr thr pro gly trp arg arg met cys ala 
val pro asn ser gly gin val gly arg gly arg arg arg ala gly gly glu cys ala arg 
601/201 631/211 

gac gee gtc ggc aac ttg gee get gag gtc ggc tga tec ctg ggc ega ggc ggg tea gee 
asp ala val gly asn leu ala ala glu val gly OPA ser leu gly arg gly gly ser ala 
thr pro ser ala thr trp pro leu arg ser ala asp pro trp ala glu ala gly gin pro 
arg arg arg gin leu gly arg OPA gly arg leu ile pro gly pro arg arg val ser gin 
661/221 691/231 

aat age ggc tec ate ggc ttt get ggt age ggt teg gcg gga age tag egg ega cgt tgt 
asn ser gly ser ile gly phe ala gly ser gly ser ala gly ser AMB arg arg arg cys 
ile ala ala pro ser ala leu leu val ala val arg arg glu ala ser gly asp val val 
AMB arg leu his arg leu cys trp AMB arg phe gly gly lys leu ala ala thr leu ser 



SEQ ID N0.14Q 



FIGURE 14Q 
REPLACEMENT SHEET (RULE 26) 



49/185 



721/241 751/251 

egg tgg ccg gtg ata tat tgg gtc aga egg gta tgg egg egg ctg agg tga tct gcg aca 
arg trp pro val ile tyr trp val arg arg val trp arg arg leu arg CPA ser ala thr 
gly gly arg OPA tyr ile gly ser asp gly tyr gly gly gly OPA gly asp leu arg hrs 
val ala gly asp ile leu gly gin thr gly met ala ala ala glu val lie cys asp thr 
781/261 811/271 

cgc cgc cgc ggt get cga gee agg ctt acg acc agg gaa ttt cga aaa tgt tat tea gaa 
arg arg arg gly ala arg ala arg leu thr thr arg glu phe arg lys cys tyr ser glu 
ala ala ala val leu glu pro gly leu arg pro gly asn phe glu asn val lie gin asn 
pro pro arg cys ser ser gin ala tyr asp gin gly ile ser lys met leu phe arg thr 
841/281 871/291 

cat ctt gta tct ctt etc cgt gcc acc ccc tag gtg tag tgt ttt cga gta ccg gca gat 
his leu val ser leu leu arg ala thr pro AMB val AMB cys phe arg val pro ala asp 
ile leu tyr leu phe ser val pro pro pro arg cys ser val phe glu tyr arg gin ile 
ser cys ile ser ser pro cys his pro leu gly val val phe ser ser thr gly arg ser 
901/301 931/311 

ccc agg ttc acc agg tct cac cag ate cac ggg gcg cga tga act tec egg cat egg cat 
pro arg phe thr arg ser his gin ile his gly ala arg OPA thr ser arg his arg his 
pro gly ser pro gly leu thr arg ser thr gly arg asp glu leu pro gly lie gly lie 
gin val his gin val ser pro asp pro arg gly ala met asn phe pro ala ser ala ser 
961/321 991/331 

cgc cag gtc gac gga cgt ggt cgc get atg acg gga ate tgg age ctt gtc ggg ccg etc 
arg gin val asp gly arg gly arg ala met thr gly ile trp ser leu val gly pro leu 
ala arg ser thr asp val val ala leu OPA arg glu ser gly ala leu ser gly arg ser 
pro gly arg arg thr trp ser arg tyr asp gly asn leu glu pro cys arg ala ala gin 
1021/341 1051/351 

aae ata teg aag atg cac tac ttg agt cgt tgc cag ate ctg tea gat tec cga ttt ccg 
asn ile ser lys met his tyr leu ser arg cys gin ile leu ser asp ser arg phe pro 
thr tyr arg arg cys thr thr OPA val val ala arg ser cys gin ile pro asp phe arg 
his ile glu asp ala leu leu glu ser leu pro asp pro val arg phe pro ile ser ala 
1081/361 1111/371 
caa agg age ggt acg ccc atg acc gtg ace gtt tac act aa 
gin arg ser gly thr pro met thr val thr val tyr thr 
lys gly ala val arg pro OPA pro OPA pro phe thr leu 
lys glu arg tyr ala his asp arg asp arg leu his OCH 

SEQ ID No, 14Q{ continued) 
FIGURE 14Q (continued) 



1/1 

CAA GCC 
gin ala 
61/21 
GCG CAC 
ala his 
121/41 
CGA TGC 
arg cys 
181/61 
GGC GGI 
gly gl^ 
241/81 















31/11 


















CGA 


CTG 


TTT 


GCC 


GTT 


TTG 


GGG CTC 


CTA 


CCA 


GAA 


CAC 


CAC 


CTG 


GCG 


GCC 


arg 


leu 


phe 


ala 


val 


leu 


gly leu 


leu 


pro 


glu 


his 


his 


leu 


ala 


ala 










91/31 


















GTG 


CAC 


CAG 


TTG 


CGA 


TCG 


GTT CCT 


CCC 


GCG 


CGC 


GGG 


CGG 


CGA 


CGA 


CGT 


val 


his 


gin 


leu 


arg 


ser 


val pro 
151/51 


pro 


ala 


arg 


gly 


arg 


arg 


arg 


arg 


CCC 


GGC 


GGC 


GCA 


GCT 


GCG 


TAG CTC 


GAC 


CCG 


GTC 


GAC 


GAC 


GAC 


GGG 


GTC 


pro 


gly 


gly 


ala 


ala 


ala 


AMB leu 


asp 


pro 


val 


asp 


asp 


asp 


gly 


val 










211/71 


















GGC 


GAT 


GTC 


GAG 


GCG 


ATG 


GCA ATA 


CAG 


CGC 


CTT 


GGT 


GCG 


CGG 


CCA 


CAC 


gly 


asp 


val 


glu 


ala 


met 


ala ile 


gin 


arg 


leu 


gly 


ala 


arg 


pro 


his 










271/91 


















GAA 


GAC 


CAG 


TCC 


CGC 


GCC 


CAC CGG 


CAG 


CCG 


GAT 


CCG 


GAT 


ACG 


CGG 


TAC 


glu 


asp 


gin 


ser 


arg 


ala 


his arg 


gin 


pro 


asp 


pro 


asp 


thr 


arg 


tyr 



SEQ ID N0.15A 
FIC5URE 15A 



REPLACEMENT SHEET (RULE 26) 



50/185 



AAG CCC 


GGC 


CGC 


GAC 


TGT 


TTG 


lys pro 


gly 


arg 


asp 


cys 


leu 


62/21 












CGC ACC 


ATG 


GTG 


TGC 


ACC 


AGT 


arg thr 


met 


val 


cys 


thr 


ser 


122/41 












GAT GCC 


CGC 


GCC 


CCG 


GCG 


GCG 


asp ala 


arg 


ala 


pro 


ala 


ala 


182/61 












GCG GAC 


CAG 


TCG 


GCG 


ATG 


TCG 


ala asp 


gin 


ser 


ala 


met 


ser 


242/81 












TCT GAG 


GTG 


GCG 


AAG 


ACC 


AGT 


ser glu 


val 


ala 


lys 


thr 


ser 









32/11 




CCG 


TTT 


TGG 


GGC TCC 


TAC 


pro 


phe 


trp 


gly ser 


tyr 








92/ 31 




TGC 


GAT 


CGG 


TTC CTC 


CCG 


cys 


asp 


arg 


phe leu 


pro 








152/51 




CAG 


CTG 


CGT 


AGC TCG 


ACC 


gin 


leu 


arg 


ser ser 


thr 








212/71 




AGG 


CGA 


TGG 


CAA TAC 


AGC 


arg 


arg 


trp 


gin tyr 


ser 








272/91 




CCC 


GCG 


CCC 


ACC GGC 


AGC 


pro 


ala 


pro 


thr gly 


ser 



SEQ ID No.lSB 
FIGURE 15B 



CAG AAC ACC ACC TGG CGG CCG 
gin asn thr thr trp arg pro 

CGC GCG GGC GGC GAC GAC GTC 
arg ala gly gly asp asp val 

CGG TCG ACG ACG ACG GGG TCG 
arg ser thr thr thr gly ser 

GCC TTG GTG CGC GGC CAC ACG 
ala leu val arg gly his thr 

CGG ATC CGG ATA CGC GGT AC 
arg lie arg ile arg gly 



33/11 

AGC CCG GCC GCG ACT GTT TGC CGT TTT GGG GCT CCT ACC AGA ACA CCA CCT GGC GGC CGC 
ser pro ala ala thr val cys arg phe gly ala pro thr arg thr pro pro gly gly arg 
63/21 93/31 

GCA CCA TGG TGT GCA CCA GTT GCG ATC GGT TCC TCC CGC GCG CGG GCG GCG ACG ACG TCG 
ala pro trp cys ala pro val ala ile gly ser ser arg ala arg ala ala thr thr ser 
123/41 153/51 

ATG CCC GCG CCC CGG CGG CGC AGC TGC GTA GCT CGA CCC GGT CGA CGA CGA CGG GGT CGG 
met pro ala pro arg arg arg ser cys val ala arg pro gly arg arg arg arg gly arg 
183/61 213/71 

CGG ACC AGT CGG CGA TGT CGA GGC GAT GGC AAT ACA GCG CCT TGG TGC GCG GCC ACA CGT 
arg thr ser arg arg cys arg gly asp gly asn thr ala pro trp cys ala ala thr arg 
243/81 273/91 

CTG AGG TGG CGA AGA CCA GTC CCG CGC CCA CCG GCA GCC GGA TCC GGA TAC GCG GTA C 
leu arg trp arg arg pro val pro arg pro pro ala ala gly ser gly tyr ala val 



SEQ ID No ,150 
FIGURE 15C 



REPLACEMENT SHEET (RULE 26) 



51/185 



part of the nucleotide sequence of seqlSA 



1/1 31/11 



GGC GGC 


GGC 


GCG 


CCA 


TGG 


TGT 


GCA 


CCA 


GTT 


GCG ATC 


GGT 


TCT 


CCC 


GCG 


CGC 


GGG 


CGG 


CGA 


gly gly 


arg 


ala 


pro 


trp 


cys 


ala 


pro 


val 


ala ile 


gly 


ser 


pro 


ala 


arg 


gly 


arg 


arg 


61/21 


















91/31 


















CGA CGT 


CGA 


TGG 


CCG 


CGC 


CCC 


GGC 


GGC 


TGC 


AGC TGC 


GTA 


GCT 


CGA 


CCC 


GGT 


CGA 


CGA 


CGA 


arg arg 


arg 


trp 


pro 


arg 


pro 


gly 


gly 


cys 


ser cys 


val 


ala 


arg 


pro 


gly 




arg 


arg 


121/41 


















151/51 


















CGG GGT 


CGG 


CGG 


GCC 


AGT 


CGG 


CGA 


TGT 


CGA 


GGC GAT 


GGC 


AAT 


ACA 


GCG 


CCT 


TGG 


TGC 


GCG 


arg gly 


arg 


arg 


ala 


ser 


arg 


arg 


cys 


arg 


gly asp 


gly 


asn 


thr 


ala 


pro 


trp 


cys 


ala 


181/61 


















211/71 


















GCC ACA 


CGT 


CTG 


AGG 


TGG 


CGA 


AGA 


CCA 


GTC 


CCG CGC 


CCA 


CCG 


GCA 


GCC 


GGA 


TC 






ala thr 


arg 


leu 


arg 


trp 


arg 


arg 


pro 


val 


pro arg 


pro 


pro 


ala 


ala 


gly 









SEQ ID N0.15A' 
FIGURE 15A' 



1/1 

GCG GCC GCG CGC 
ala ala ala arg 
61/21 

GAC GTC GAT GGC 
asp val asp gly 
121/41 

GGG GTC GGC GGG 
gly val gly gly 
181/61 

CCA CAC GTC TGA 
pro his val OPA 



CAT GGT GTG CAC 
his gly val his 

CGC GCC CCG GCG 
arg ala pro ala 

CCA GTC GGC GAT 
pro val gly asp 

GGT GGC GAA GAC 
gly gly glu asp 







31/11 


CAG 


TTG 


CGA TCG 


gin 


leu 


arg ser 






91/31 


GCT 


GCA 


GCT GCG 


ala 


ala 


ala ala 






151/51 


GTC 


GAG 


GCG ATG 


val 


glu 


ala met 






211/71 


CAG 


TCC 


CGC GCC 


gin 


ser 


arg ala 



SEQ ID No,15B 
FIGURE 15B' 



GTT CTC CCG CGC 
val leu pro arg 

TAG CTC GAC CCG 
AMB leu asp pro 

GCA ATA CAG CGC 
ala ile gin arg 

CAC CGG CAG CCG 
his arg gin pro 



GCG GGC GGC GAC 
ala gly gly asp 

GTC GAC GAC GAC 
val asp asp asp 

CTT GGT GCG CGG 
leu gly ala arg 

GAT C 
asp 



1/1 

TGG CGG CCG CGC GCC 
trp arg pro arg ala 
61/21 

ACG ACG TCG ATG GCC 
thr thr ser met ala 
121/41 

ACG GGG TCG GCG GGC 
thr gly ser ala gly 
181/61 

GGC CAC ACG TCT GAG 
gly his thr ser glu 



ATG GTG TGC ACC AGT 
met val cys thr ser 

GCG CCC CGG CGG CTG 
ala pro arg arg leu 

CAG TCG GCG ATG TCG 
gin ser ala met ser 

GTG GCG AAG ACC AGT 
val ala lys thr ser 



31/11 

TGC GAT CGG TTC TCC 
cys asp arg phe ser 
91/31 

CAG CTG CGT AGC TCG 
gin leu arg ser ser 
151/51 

AGG CGA TGG CAA TAC 
arg arg trp gin tyr 
211/71 

CCC GCG CCC ACC GGC 
pro ala pro thr gly 



CGC GCG CGG GCG GCG 
arg ala arg ala ala 

ACC CGG TCG ACG ACG 
thr arg ser thr thr 

AGC GCC TTG GTG CGC 
ser ala leu val arg 

AGC CGG ATC 
ser arg ile 



SEQ ID No,15C' 

FIGURE 
REPLACEMENT SHEET (RULE 26) 
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ORF containing SeqlSA' according to Cole et al . (Nature 393:537-544) 
1/1 31/11 

taa ggt cog cca acg ctt tac get cga egg ccg cca cga gtt ggc egg cca ctt tea ggc 
OCH gly pro pro thr leu tyr ala arg arg pro pro arg val gly arg pro leu ser gly 
61/21 91/31 

cgt agt cgc cgc agg gca ggg ctt ccc gcg teg tct teg egg gtt tgt egg caa agg tgt 
arg ser arg arg arg ala gly leu pro ala ser ser ser arg val cys arg gin arg cys 
121/41 151/51 

agg ggt age gtt cgt ggg cgt cga cga cga tgt gca get egg gga tge egg egg cgc ggg 
arg gly ser val arg gly arg arg arg arg cys ala ala arg gly cys arg arg arg gly 
181/61 211/71 

egg tgg ggg tge gca cgc ccg gee gcg act gtt tge gcg ttt tgg ggc tct gcc aga aca 
arg trp gly cys ala arg pro ala ala thr val cys ala phe trp gly ser ala arg thr 
241/81 271/91 

cca cct ggc ggc cgc gcg cca tgg tgt gca cca gtt gcg ate ggt tct ccc gcg cgc ggg 
pro pro gly gly arg ala pro trp cys ala pro val ala ile gly ser pro ala arg gly 
301/101 331/111 

egg ega cga cgt cga tgg ccg cgc ccc ggc ggc tge age tge gta get cga ccc ggt cga 
arg arg arg arg arg trp pro arg pro gly gly cys ser cys val ala arg pro gly arg 
361/121 391/131 

ega cga egg ggt egg egg gee agt egg cga tgt ega ggc gat ggc aat aca gcg cct tgg 
arg arg arg gly arg arg ala ser arg arg cys arg gly asp gly asn thr ala pro trp 
421/141 451/151 

tge gcg gcc aca cgt ctg agg tgg ega aga cca gtc ccg cgc cca ccg gca gee gga tea 
cys ala ala thr arg leu arg trp arg arg pro val pro arg pro pro ala ala gly ser 
481/161 511/171 

ggt agg gca ggc gcg agt ctt eag egg ggt tgg egg cga cga gca get cca cag agt gtg 
gly arg ala gly ala ser leu gin arg gly trp arg arg arg ala ala pro gin ser val 
541/181 571/191 

agg gta egg gcg gcg tac ggc aac ggt gaa gca ggc act ccg acg aac cca tog tea cgt 
arg val arg ala ala tyr gly asn gly glu ala gly thr pro thr asn pro ser ser arg 
601/201 

cga agg ggc agg tga 
arg arg gly arg OPA 



SEQ ID N0.15F 
FIGURE 15F 



REPLACEMENT SHEET (RULE 26) 



# 
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R:Rv2530c predicted according to Cole et al . (Nature 393:537-544) and which may 
be in phase with SEQ15A 

1/1 31/11 



gtg a eg 


gca 


ctg 


etc 


gat 


gtc 


aat 


gtg 


ctg 


ate gcg 


ctg 


gge 


tgg 


ccg 


aat 


cac 


gtt 


cac 


val thr 


ala 


leu 


leu 


asp 


val 


asn 


val 


leu 


ile ala 


leu 


gly 


trp 


pro 


asn 


his 


val 


his 


61/21 


















91/31 


















cat gcg 


gcc 


gcg 


cag 


cga 


tgg 


ttc 


aeg 


cag 


ttc tec 


teg 


aat 


ggg 


tgg 


gee 


ace 


aeg 


ccg 


his ala 


ala 


ala 


gin 


arg 


trp 


phe 


thr 


gin 


phe ser 


ser 


asn 


gly 


trp 


ala 


thr 


thr 


pro 


121/41 


















151/51 


















ate acc 


gag 


gca 


ggg 


tat 


gtc 


cga 


att 


tea 


age aat 


cgc 


agt 


gtg 


atg 


cag 


gtg 


teg 


ace 


lie thr 


glu 


ala 


gly 


tyr 


val 


arg 


ile 


ser 


ser asn 


arg 


ser 


val 


met 


gin 


val 


ser 


thr 


181/61 


















211/71 


















acg ccg 


get 


ate 


gcg 


ate 


get 


cag 


ttg 


gcg 


gcg atg 


act 


tet 


ett 


gee 


ggg 


cac 


aeg 


ttt 


thr pro 


ala 


ile 


ala 


ile 


ala 


gin 


leu 


ala 


ala met 


thr 


ser 


leu 


ala 


gly 


his 


thr phe 


241/81 


















271/91 


















tgg cct 


gac 


gat 


gtg 


cca 


ctg 


ate 


gtt 


ggg 


age gcc 


ggc 


gat 


cgc 


gat 


gcg 


gtg 


tec 


aac 


trp pro 


asp 


asp 


val 


pro 


leu 


ile 


val 


gly 


ser ala 


gly 


asp 


arg 


asp 


ala 


val 


ser 


asn 


301/101 


















331/111 


















cac cgt 


egg 


gtc 


acc 


gac 


tgc 


cat 


etc 


ate 


gee ttg 


gee 


gcg 


cgc 


tac 


ggg 


ggc 


egg 


ttg 


1 his arg 


arg 


val 


thr 


asp 


cys 


his 


leu 


ile 


ala leu 


ala 


ala 


arg 


tyr 


gly 


gly 


arg 


leu 


2 361/121 


















391/131 


















U ^tc aca 


ttc 


gat 


gcc 


gca 


ctg 


gcc 


gat 


tea 


gca tec 


gca 


gge 


etc 


gtc 


gag 


gtg 


ttg 


tag 


■J val thr phe 


asp 


ala 


ala 


leu 


ala 


asp 


ser 


ala ser 


ala 


gly 


leu 


val 


glu 


val 


leu 


AMB 



SEQ ID No.lSR 



FIGURE 15R 



Seql5P: ORF according to Cole et al . (Nature 393:537-544) containing Rv2530e 
1/1 31/11 

tga tgt tec gee gga tgc gee gac ggt gac ttc cga gga tgt cgt ccg cgc get cga gga 
OPA cys ser ala gly cys ala asp gly asp phe arg gly cys arg pro arg ala arg gly 
61/21 91/31 

cga cgt gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac 
arg arg val thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his 
121/41 , 151/51 

gtt cac cat gcg gee gcg cag ega tgg ttc acg cag ttc tec teg aat ggg tgg gee acc 
val his his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr 
181/61 211/71 

acg ccg ate ace gag gca ggg tat gtc ega att tea age aat ege agt gtg atg cag gtg 
thr pro ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val 
241/81 271/91 

teg ace acg ccg get ate gcg ate get cag ttg gcg gcg atg act tet ett gee ggg eac 
ser thr thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his 
301/101 331/111 

aeg ttt tgg cct gac gat gtg cca ctg ate gtt ggg age gcc ggc gat cgc gat gcg gtg 
thr phe trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val 
361/121 391/131 

tec aac eac cgt egg gtc ace gac tgc cat etc ate gee ttg gcc gcg ege tae ggg ggc 
ser asn his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly 
421/141 451/151 

egg ttg gtc aca ttc gat gee gca ctg gcc gat tea gca tec gca gge etc gtc gag gtg 
arg leu val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val 
481/161 
ttg tag 
leu AMB 

SEQ ID No,15P 



FIGURE 15P 
REPLACEMENT SHEET (RULE 26) 



54/185 



Fragment containing SeqlSP' and Seq 15F' 
1/1 31/11 

tga tgt tec gcc gga tgc gcc gac ggt gac ttc cga gga tgt cgt ccg ego get cga gga 
OPA cys ser ala gly cys ala asp gly asp phe arg gly cys arg pro arg ala arg gly 
asp val pro pro asp ala pro thr val thr ser glu asp val val arg ala leu glu asp 
Met phe arg arg met arg arg arg OPA leu pro arg met ser ser ala arg ser arg thr 
61/21 91/31 

cga cgt gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac 
arg arg val thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his 
asp val OPA arg his cys ser met ser met cys OPA ser arg trp ala gly arg ile thr 
thr cys asp gly thr ala arg cys gin cys ala asp arg ala gly leu ala glu ser arg 
121/41 151/51 

gtt cac cat gcg gcc gcg cag cga tgg ttc acg cag ttc tec teg aat ggg tgg gee ace 
val his his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr 
phe thr met arg pro arg ser asp gly ser arg ser ser pro arg met gly gly pro pro 
ser pro cys gly arg ala ala met val his ala val leu leu glu trp val gly his his 
181/61 211/71 

acg ccg ate acc gag gca ggg tat gtc cga att tea age aat cgc agt gtg atg cag gtg 
thr pro ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val 
arg arg ser pro arg gin gly met ser glu phe gin ala ile ala val OPA cys arg cys 
ala asp his arg gly arg val cys pro asn phe lys gin ser gin cys asp ala gly val 
241/81 271/91 

teg ace acg ccg get ate gcg ate get cag ttg gcg gcg atg act tet ett gcc ggg cac 
ser thr thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his 
arg pro arg arg leu ser arg ser leu ser trp arg arg OPA leu leu leu pro gly thr 
asp his ala gly tyr arg asp arg ser val gly gly asp asp phe ser cys arg ala his 
301/101 331/111 

acg ttt tgg cct gac gat gtg cca ctg ate gtt ggg age gcc ggc gat cgc gat gcg gtg 
thr phe trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val 
arg phe gly leu thr met cys his OPA ser leu gly ala pro ala ile ala met arg cys 
val leu ala OPA arg cys ala thr asp arg trp glu arg arg arg ser arg cys gly val 
361/121 391/131 

tee aac cac cgt egg gtc acc gac tgc cat etc ate gcc ttg gcc gcg cgc tac ggg ggc 
ser asn his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly 
pro thr thr val gly ser pro thr ala ile ser ser pro trp pro arg ala thr gly ala 
gin pro pro ser gly his arg leu pro ser his arg leu gly arg ala leu arg gly pro 
421/141 451/151 

egg ttg gtc aca ttc gat gee gca ctg gcc gat tea gca tec gca ggc etc gtc gag gtg 
arg leu val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val 
gly trp ser his ser met pro his trp pro ile gin his pro gin ala ser ser arg cys 
val gly his ile arg cys arg thr gly arg phe ser ile arg arg pro arg arg gly val 
481/161 511/171 

ttg tag tea ccg ggg atg ggc ggc teg cca ggc ctg cag gat ctg egg gcg cag gcg ecc 
leu AMB ser pro gly met gly gly ser pro gly leu gin asp leu arg ala gin ala pro 
cys ser his arg gly trp ala ala arg gin ala cys arg ile cys gly arg arg arg pro 
val val thr gly asp gly arg leu ala arg pro ala gly ser ala gly ala gly ala pro 
541/181 571/191 

ccg gtc gga cac egg cag gcc gac get ttt ggc cca cgc gcg cag etc ggc get get ggg 
pro val gly his arg gin ala asp ala phe gly pro arg ala gin leu gly ala ala gly 
arg ser asp thr gly arg pro thr leu leu ala his ala arg ser ser ala leu leu gly 
gly arg thr pro ala gly arg arg phe trp pro thr arg ala ala arg arg cys trp ala 
601/201 631/211 

etc ggg etc ggc ggc age egg etc gaa aae cgt ggt ggc gtc ggc ate gtc gac gaa cca 
leu gly leu gly gly ser arg leu glu asn arg gly gly val gly ile val asp glu pro 
ser gly ser ala ala ala gly ser lys thr val val ala ser ala ser ser thr asn gin 
arg ala arg arg gin pro ala arg lys pro trp trp arg arg his arg arg arg thr arg 



SEQ ID N0.15Q 
FIGURE 15Q 



REPLACEMENT SHEET (RULE 26) 
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661/221 691/231 

ggt gag ggc ggc ggc tag ata gcg gta ggt gta ttc ctg ggc gag ctt gcg ggt ttg gca 
gly glu gly gly gly AMB ile ala val gly val phe leu gly glu leu ala gly leu ala 
val arg ala ala ala arg AMB arg AMB val tyr ser trp ala ser leu arg val trp gin 
OPA gly arg arg leu asp ser gly arg cys ile pro gly arg ala cys gly phe gly arg 
721/241 751/251 

gaa cac gat egg cac gtt ggg aaa gcc gat ctg caa ttc ggc cag ccc ate ggc gat cgc 
glu his asp arg his val gly lys ala asp leu gin phe gly gin pro ile gly asp arg 
asn thr ile gly thr leu gly lys pro ile cys asn ser ala ser pro ser ala ile ala 
thr arg ser ala arg trp glu ser arg ser ala ile arg pro ala his arg arg ser pro 
781/261 811/271 

cgt egg gcg ggc gaa gga gtg cgc gaa gat etc cga gta gcg gtc etc gac cac cac ggc 
arg arg ala gly glu gly val arg glu asp leu arg val ala val leu asp his his gly 
val gly arg ala lys glu cys ala lys ile ser glu AMB arg ser ser thr thr thr ala 
ser gly gly arg arg ser ala arg arg ser pro ser ser gly pro arg pro pro arg arg 
841/281 871/291 

ggc ccg tgg cag cgc ggc cag ttc ggt cag ttg gta ttt cag gtt gcc gtt cag cac gcc 
gly pro trp gin arg gly gin phe gly gin leu val phe gin val ala val gin his ala 
ala arg gly ser ala ala ser ser val ser trp tyr phe arg leu pro phe ser thr pro 
pro val ala ala arg pro val arg ser val gly ile ser gly cys arg ser ala arg gin 
901/301 931/311 

aga agt aag gtc cgc caa cgc ttt acg etc gac ggc cgc cac gag ttg gcc ggc cac ttt 
arg ser lys val arg gin ar^ phe thr leu asp gly arg his glu leu ala gly his phe . 
glu val arg ser ala asn ala leu arg ser thr ala ala thr ser trp pro ala thr phe 
lys OCH gly pro pro thr leu tyr ala arg arg pro pro arg val gly arg pro leu ser 
961/321 991/331 

cag gcc gta gtc gcc gca ggg cag ggc ttc ccg cgt cgt ctt cgc ggg ttt gtc ggc aaa 
gin ala val val ala ala gly gin gly phe pro arg arg leu arg gly phe val gly lys 
arg pro AMB ser pro gin gly arg ala ser arg val val phe ala gly leu ser ala lys 
gly arg ser arg arg arg ala gly leu pro ala ser ser ser arg val cys arg gin arg 
1021/341 1051/351 

ggt gta ggg gta gcg ttc gtg ggc gtc gac gac gat gtg cag etc ggg gat gee ggc ggc 
gly val gly val ala phe val gly val asp asp asp val gin leu gly asp ala gly gly 
val AMB gly AMB arg ser trp ala ser thr thr met cys ser ser gly met pro ala ala 
cys arg gly ser val arg gly arg arg arg arg cys ala ala arg gly cys arg arg arg 
1081/361 1111/371 

gcg ggc ggt ggg ggt gcg cac gee egg ccg cga ctg ttt gcg cgt ttt ggg get ctg cea 
ala gly gly gly gly ala his ala arg pro arg leu phe ala arg phe gly ala leu pro 
arg ala val gly val arg thr pro gly arg asp cys leu arg val leu gly leu cys gin 
gly arg trp gly cys ala arg pro ala ala thr val cys ala phe trp gly ser ala arg 
1141/381 1171/391 

gaa cac cac ctg gcg gcc gcg cgc cat ggt gtg cac cag ttg cga teg gtt etc ccg cgc 
glu his his leu ala ala ala arg his gly val his gin leu arg ser val leu pro arg 
asn thr thr trp arg pro arg ala met val cys thr ser cys asp arg phe ser arg ala 
thr pro pro gly gly arg ala pro trp cys ala pro val ala ile gly ser pro ala arg 
1201/401 1231/411 

gcg ggc ggc gac gac gtc gat ggc cgc gcc ccg gcg get gca get gcg tag etc gac ccg 
ala gly gly asp asp val asp gly arg ala pro ala ala ala ala ala AMB leu asp pro 
arg ala ala thr thr ser met ala ala pro arg arg leu gin leu arg ser ser thr arg 
gly arg arg arg arg arg trp pro arg pro gly gly cys ser cys val ala arg pro gly 
1261/421 1291/431 

gtc gac gac gac ggg gtc ggc ggg eca gtc ggc gat gtc gag gcg atg gca ata cag cgc 
val asp asp asp gly val gly gly pro val gly asp val glu ala met ala ile gin arg 
ser thr thr thr gly ser ala gly gin ser ala met ser arg arg trp gin tyr ser ala 
arg arg arg arg gly arg arg ala ser arg arg cys arg gly asp gly asn thr ala pro 
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1321/441 1351/451 

ctt ggt gcg egg cca cac gtc tga ggt ggc gaa gac cag tec cgc gcc cac egg cag ccg 
leu gly ala arg pro his val OPA gly gly glu asp gin ser arg ala his arg gin pro 
leu val arg gly his thr ser glu val ala lys thr ser pro ala pro thr gly ser arg 
trp cys ala ala thr arg leu arg trp arg arg pro val pro arg pro pro ala ala gly 
1381/461 1411/471 

gat cag gta ggg cag gcg cga gtc ttc age ggg gtt ggc ggc gac gag cag etc cac aga 
asp gin val gly gin ala arg val phe ser gly val gly gly asp glu gin leu his arg 
ile arg AMB gly arg arg glu ser ser ala gly leu ala ala thr ser ser ser thr glu 
ser gly arg ala gly ala ser leu gin arg gly trp arg arg arg ala ala pro gin ser 
1441/481 1471/491 

gtg tga ggg tac ggg egg cgt acg gca acg gtg aag cag gca etc cga cga acc cat cgt 
val OPA gly tyr gly arg arg thr ala thr val lys gin ala leu arg arg thr his arg 
cys glu gly thr gly gly val arg gin arg OPA ser arg his ser asp glu pro ile val 
val arg val arg ala ala tyr gly asn gly glu ala gly thr pro thr asn pro ser ser 
1501/501 

cac gtc gaa ggg gca ggt ga 
his val glu gly ala gly 
thr ser lys gly gin val 
arg arg arg gly arg OPA 



SEQ ID No,15Q (continued 2) 
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TGC GCA 


TGC 


CGA 


CCA 


GTG 


TGG 


TTG 


GCC 


GGA 


GTT CGT 


TTG 


TTC 


GCG 


ATT 


GCC 


TCA 


ACG 


ATT 


cys ala 


cys 


arg 


pro 


val 


trp 


leu 


ala 


gly val arg 


leu 


phe 


ala 


ile 


ala 


ser 


thr 


ile 


61/21 
















91/31 


















CGA TAT 


AAC 


CAC 


TCT 


AGT 


CAC 


ATC 


AAC 


CAC 


ACT CGT 


ACC 


ATC 


GAG 


CGT 


GTG 


GGT 


TCA 


TGC 


arg tyr 


asn 


his 


ser 


ser 


his 


ile 


asn 


his 


thr arg 


thr 


ile 


glu 


arg 


val 


gly 


ser 


cys 


121/41 


















151/51 


















CAT GCA 


TTC 


GCG 


ACC 


GCG 


GGA 


GCC 


GGC 


GAA 


CCC GGC 


GCC 


ACA 


CAT 


AAT 


CCA 


GAT 


TGA 


GGA 


his ala 


phe 


ala 


thr 


ala 


gly 


ala 


gly 


glu 


pro gly 


ala 


thr 


his 


asn 


pro 


asp 


OPA 


gly 


181/61 
















211/71 


















GAC TTC 


CGT 


GCC 


GAA 


CCG 


ACG 


CCG 


ACG 


CAA 


GCT TTC 


GAC 


AGC 


CAT 


GAG 


CGC 


GGT 


CGC 


CGC 


asp phe 


arg 


ala 


glu 


pro 


thr 


pro 


thr 


gin 


ala phe 


asp 


ser 


his 


glu 


arg 


gly 


arg 


arg 


241/81 
















271/91 


















CCT GGC 


AGT 


TGC 


AAG 


TCC 


TTG 


TGC 


ATA 


TTT 


TCT TGT 


CTA 


CGA 


ATC 


AAC 


CGA 


AAC 


GAC 


CGA 


pro gly 


ser 


cys 


lys 


ser 


leu 


cys 


ile 


phe 


ser cys 


leu 


arg 


ile 


asn 


arg 


asn 


asp 


arg 


301/101 


















331/111 


















GCG GCC 


CGA 


GCA 


CCA 


TGA 


ATT 


CAA 


GCA 


GGC 


GGC GGT 


GTT 


GAC 


CGA 


CCT 


GCC 


CGG 


CGA 


GCT 


ala ala 


arg 


ala 


pro 


OPA 


ile 


gin 


ala 


gly 


gly gly 


val 


asp 


arg 


pro 


ala 


arg 


arg 


ala 


361/121 
















391/131 


















GAT GTC 


CGC 


GCT 


ATC 


GCA 


GGG 


GTT 


GTC 


CCA 


GTT CGG 


GAT 


C 














asp val 


arg 


ala 


ile 


ala 


gly 


val 


val 


pro 


val arg 


asp 
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32/11 



GCG CAT 


GCC 


GAC 


CAG 


TGT 


GGT 


TGG 


CCG 


GAG 


TTC GTT 


TGT 


TCG 


CGA 


TTG 


CCT 


CAA 


CGA 


TTC 


ala his 


ala 


asp 


gin 


cys 


gly 


trp 


pro 


glu 


phe val 


cys 


ser 


arg 


leu 


pro 


gin 


arg 


phe 


62/21 


















92/31 


















GAT ATA 


ACC 


ACT 


CTA 


GTC 


ACA 


TCA 


ACC 


ACA 


CTC GTA 


CCA 


TCG 


AGC 


GTG 


TGG 


GTT 


CAT 


GCC 


asp ile 


thr 


thr 


leu 


val 


thr 


ser 


thr 


thr 


leu val 


pro 


ser 


ser 


val 


trp 


val 


his 


ala 


122/41 


















152/51 


















ATG CAT 


TCG 


CGA 


CCG 


CGG 


GAG 


CCG 


GCG 


AAC 


CCG GCG 


CCA 


CAC 


ATA 


ATC 


CAG 


ATT 


GAG 


GAG 


met his 


ser 


arg 


pro 


arg 


glu pro 


ala 


asn 


pro ala 


pro 


his 


ile 


ile 


gin 


ile 


glu 


glu 


182/61 


















212/71 


















ACT TCC 


GTG 


CCG 


AAC 


CGA 


CGC 


CGA 


CGC 


AAG 


CTT TCG 


ACA 


GCC 


ATG 


AGC 


GCG 


GTC 


GCC 


GCC 


thr ser 


val 


pro 


asn 


arg 


arg 


arg 


arg 


lys 


leu ser 


thr 


ala 


met 


ser 


ala 


val 


ala 


ala 


242/81 


















272/91 


















CTG GCA 


GTT 


GGA 


AGT 


CCT 


TGT 


GCA 


TAT 


TTT 


CTT GTC 


TAC 


GAA 


TCA 


ACC 


GAA 


ACG 


ACC 


GAG 


leu ala 


val 


ala 


ser 


pro 


cys 


ala 


tyr 


phe 


leu val 


tyr 


glu 


ser 


thr 


glu 


thr 


thr 


glu 


3 302/101 


















332/111 


















CGG CCC 


GAG 


CAC 


CAT 


GAA 


TTC 


AAG 


CAG 


GCG 


GCG GTG 


TTG 


ACC 


GAC 


CTG 


CCC 


GGC 


GAG 


CTG 


f: arg pro 


glu 


his 


his 


glu 


phe 


lys 


gin 


ala 


ala val 


leu 


thr 


asp 


leu 


pro 


gly 


glu 


leu 


'f: 362/121 


















392/131 


















y ATG TCC 


GCG 


CTA 


TCG 


CAG 


GGG 


TTG 


TCC 


CAG 


TCC GGG 


ATC 
















met ser 


ala 


leu 


ser 


gin 


gly 


leu 


ser 


gin 


phe gly 


ile 

















SEQ ID N0.16B 
FIGURE 16B 



33/11 



CGC ATG 


CCG 


ACC 


AGT 


GTG 


GTT 


GGC 


CGG 


AGT 


TCG TTT 


GTT 


CGC 


GAT 


TGC 


CTC 


AAC 


GAT 


TCG 


arg met 


pro 


thr 


ser 


val 


val 


gly 


arg 


ser 


ser phe 


val 


arg 


asp 


cys 


leu 


asn 


asp 


ser 


63/21 


















93/31 


















ATA TAA 


CCA 


CTC 


TAG 


TCA 


CAT 


CAA 


CCA 


CAC 


TCG TAC 


CAT 


CGA 


GCG 


TGT 


GGG 


TTC 


ATG 


CCA 


ile OCH 


pro 


leu 


AMB 


ser 


his 


gin 


pro 


his 


ser tyr 


his 


arg 


ala 


cys 


gly 


phe 


met 


pro 


123/41 


















153/51 


















TGC ATT 


CGC 


GAC 


CGC 


GGG 


AGC 


CGG 


CGA 


ACC 


CGG CGC 


CAC 


ACA 


TAA 


TCC 


AGA 


TTG 


AGG 


AGA 


cys ile 


arg 


asp 


arg 


gly 


ser 


arg 


arg 


thr 


arg arg 


his 


thr 


OCH 


ser 


arg 


leu 


arg 


arg 


183/61 


















213/71 


















CTT CCG 


TGC 


CGA 


ACC 


GAC 


GCC 


GAC 


GCA 


AGC 


TTT CGA 


CAG 


CCA 


TGA 


GCG 


CGG 


TCG 


CCG 


CCC 


leu pro 


cys 


arg 


thr 


asp 


ala 


asp 


ala 


ser 


phe arg 


gin 


pro 


OPA 


ala 


arg 


ser 


pro 


pro 


243/81 


















273/91 


















TGG CAG 


TTG 


CAA 


GTC 


CTT 


GTG 


CAT 


ATT 


TTC 


TTG TCT 


ACG 


AAT 


CAA 


CCG 


AAA 


CGA 


CCG 


AGC 


trp gin 


leu 


gin 


val 


leu 


val 


his 


ile 


phe 


leu ser 


thr 


asn 


gin 


pro 


lys 


arg 


pro 


ser 


303/101 


















333/111 


















GGC CCG 


AGC 


ACC 


ATG 


AAT 


TCA 


AGC 


AGG 


CGG 


CGG TGT 


TGA 


CCG 


ACC 


TGC 


CCG 


GCG 


AGC 


TGA 


gly pro 


ser 


thr 


met 


asn 


ser 


ser 


arg 


arg 


arg cys 


OPA 


pro 


thr 


cys 


pro 


ala 


ser 


OPA 


363/121 


















393/131 


















TGT CCG 


CGC 


TAT 


CGC 


AGG 


GGT 


TGT 


CCC 


AGT 


TCG GGA 


TC 
















cys pro 


arg 


tyr 


arg 


arg 


gly 


cys 


pro 


ser 


ser gly 



















SEQ ID N0.16C 
FIGURE 16C 



REPLACEMENT SHEET (RULE 2€) 



58/185 





















31/11 


















GCG GGC 


CAC 


CGA 


TCA 


GTC 


GAT 


CGG 


GTG 


GTT 


TCC GCT 


CCA 


TCA 


GCC 


CGG 


AAT 


TGA 


GGT 


GCC 


ala gly 


his 


arg 


ser 


val 


asp 


arg 


val 


val 


ser ala 


pro 


ser 


ala 


arg 


asn 


OPA 


gly 


ala 


61/21 


















91/31 


















GCA GTG 


ACG 


ACA 


CCA 


GCG 


CAG 


GAC 


GCG 


CCG 


TTG GTG 


TTT 


CCC 


TCT 


GTT 


GCT 


TTC 


CCG 


TCC 


ala val 


thr 


thr 


pro 


ala 


gin 


asp 


ala 


pro 


leu val 


phe 


pro 


ser 


val 


ala 


phe 


pro 


ser 


121/41 


















151/51 


















GGC TCG 


CCT 


TTT 


TTT 


CAT 


CAA 


CGT 


TGG 


ACT 


GCC GCA 


GTG 


GCG 


ATG 


TTG 


GTC 


GCC 


GGC 


GTG 


gly ser 


pro 


phe 


phe 


his 


gin 


arg 


trp 


thr 


ala ala 


val 


ala 


met 


leu 


val 


ala 


gly 


val 


181/61 
















211/71 


















TTC GGT 


CAC 


CTG 


ACG 


GTC 


GGG 


ATG 


TTC 


CTT 


GGG TCT 


CGG 


GTT 


GCT 


GCT 


GGG 


TTT 


GCT 


CAA 


phe gly his 


leu 


thr 


val 


gly met phe 


leu 


gly ser 


arg 


val 


ala 


ala 


gly phe 


ala 


gin 


241/81 


















271/91 


















TGC CCT 


GCT 


GGT 


GCG 


GCG 


TTC 


GGC 


CGA 


GTC 


GAT CAC 


CGC 


CAA 


AGA 


GCA 


CCC 


GTT 


AAA 


ACG 


cys pro 


ala 


gly 


ala 


ala 


phe 


gly 


arg 


val 


asp his 


arg 


gin 


arg 


ala 


pro 


val 


lys 


thr 


301/101 


















331/111 


















GTC GAT 


GGC 


CCT 


CAA 


CTC 


GGC 


ATC 


GCG 


ACT 


GGC GAT 


TAT 


CAC 


CAT 


GCC 


TCG 


GGC 


TGA 


TC 


val asp 


gly 


pro 


gin 


leu 


gly 


ile 


ala 


thr 


gly asp 


tyr 


his 


his 


ala 


ser 


gly 


OPA 





SEQ ID N0.17A 
FIGURE 17A 





















32/11 


















CGG GCC 


ACC 


GAT 


CAG 


TCG 


ATC 


GGG 


TGG 


TTT 


CCG CTC 


CAT 


CAG 


CCC 


GGA 


ATT 


GAG 


GTG 


CCG 


arg ala 


thr 


asp 


gin 


ser 


ile 


gly 


trp 


phe 


pro leu 


his 


gin 


pro 


gly 


ile 


glu 


val 


pro 


62/21 


















92/31 


















CAG TGA 


CGA 


CAC 


CAG 


CGC 


AGG 


ACG 


CGC 


CGT 


TGG TGT 


TTC 


CCT 


CTG 


TTG 


CTT 


TCC 


CGT 


CCG 


gin OPA 


arg 


his 


gin 


arg 


arg 


thr 


arg 


arg 


trp cys 


phe 


pro 


leu 


leu 


leu 


ser 


arg 


pro 


122/41 
















152/51 


















GCT CGC 


CTT 


TTT 


TTC 


ATC 


AAC 


GTT 


GGA 


CTG 


CCG CAG 


TGG 


CGA 


TGT 


TGG 


TCG 


CCG 


GCG 


TGT 


ala arg 


leu 


phe 


phe 


ile 


asn 


val 


gly 


leu 


pro gin 


trp 


arg 


cys 


trp 


ser 


pro 


ala 


cys 


182/61 
















212/71 


















TCG GTC 


ACC 


TGA 


CGG 


TCG 


GGA 


TGT 


TCC 


TTG 


GGT CTC 


GGG 


TTG 


CTG 


CTG 


GGT 


TTG 


CTC 


AAT 


ser val 


thr 


OPA 


arg 


ser 


gly 


cys 


ser 


leu 


gly leu 


gly 


leu 


leu 


leu 


gly 


leu 


leu 


asn 


242/81 


















272/91 


















GCC CTG 


CTG 


GTG 


CGG 


CGT 


TCG 


GCC 


GAG 


TCG 


ATC ACC 


GCC 


AAA 


GAG 


CAC 


CCG 


TTA 


AAA 


CGG 


ala leu 


leu 


val 


arg 


arg 


ser 


ala 


glu 


ser 


ile thr 


ala 


lys 


glu 


his 


pro 


leu 


lys 


arg 


302/101 


















332/111 


















TCG ATG 


GCC 


CTC 


AAC 


TCG 


GCA 


TCG 


CGA 


CTG 


GCG ATT 


ATC 


ACC 


ATG 


CCT 


CGG 


GCT 


GAT 


C 


ser met 


ala 


leu 


asn 


ser 


ala 


ser 


arg 


leu 


ala ile 


ile 


thr 


met 


pro 


arg 


ala 


asp 





SEQ ID N0.17B 
FIGURE 17B 



REPLACEMENT SHEET (RULE 26) 



# 



59/185 



GGG CCA CCG ATC AGT CGA TCG 
gly pro pro ile ser arg ser 
63/21 

AGT GAC GAC ACC AGC GCA GGA 
ser asp asp thr ser ala gly 
123/41 

CTC GCC TTT TTT TCA TCA ACG 
leu ala phe phe ser ser thr 
183/61 

CGG TCA CCT GAC GGT CGG GAT 
arg ser pro asp gly arg asp 
243/81 

CCC TGC TGG TGC GGC GTT CGG 
pro cys trp cys gly val arg 
303/101 

CGA TGG CCC TCA ACT CGG CAT 
arg trp pro ser thr arg his 



33/11 

GGT GGT TTC CGC TCC ATC 
gly gly phe arg ser ile 
93/31 

CGC GCC GTT GGT GTT TCC 
arg ala val gly val ser 

153/51 
TTG GAC TGC CGC AGT GGC 
leu asp cys arg ser gly 

213/71 
GTT CCT TGG GTC TCG GGT 
val pro trp val ser gly 

273/91 
CCG AGT CGA TCA CCG CCA 
pro ser arg ser pro pro 

333/111 
CGC GAC TGG CGA TTA TCA 
arg asp trp arg leu ser 

SEQ ID N0.17C 
FIGURE lie 



AGC CCG GAA TTG AGG TGC CGC 
ser pro glu leu arg cys arg 

CTC TGT TGC TTT CCC GTC CGG 
leu cys cys phe pro val arg 

GAT GTT GGT CGC CGG CGT GTT 
asp val gly arg arg arg val 

TGC TGC TGG GTT TGC TCA ATG 
cys cys trp val cys ser met 

AAG AGC ACC CGT TAA AAC GGT 
lys ser thr arg OCH asn gly 

CCA TGC CTC GGG CTG ATC 
pro cys leu gly leu ile 



part of the nucleotide sequence of seql7A 
3^/1 31/11 

age tag aac ccc gaa gga gac etc gcg ggt tgc egg cec ccg gcc cat egg atg cgt ate 
gly AMB asn pro glu gly asp leu ala gly cys arg pro pro ala his arg met arg lie 
61/21 ^1/^1 

egg teg cgc cga ttc acg acc gac ata ggg age tac ccc ttg ggt gat tec ggt gcg acg 
arg ser arg arg phe thr thr asp ile gly ser tyr pro leu gly asp ser gly ala thr 
121/41 151/51 

act gcg ata cgc teg gcg ggc cae cga tea gtc gat egg gtg gtt tec get cca tea gcc 
thr ala ile arg ser ala gly his arg ser val asp arg val val ser ala pro ser ala 
181/61 211/71 

egg aat tga ggt gcc gca gtg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct 
arg asn CPA gly ala ala val thr thr pro ala gin asp ala pro leu val phe pro ser 
241/81 271/91 

qtt act ttc cgt ccg gtt cgc ctt ttt ttc ate aac gtt gga ctg gcc gca gtg gcg atg 
val ala phe arg pro val arg leu phe phe ile asn val gly leu ala ala val ala met 

301/101 331/111 

ttg gtc gcc ggc gtg ttc ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg 
leu val ala gly val phe gly his leu thr val gly met phe leu gly leu gly leu leu 
361/121 391/131 

ctg ggt ttg etc aat gcc ctg ctg gtg egg cgt teg gee gag teg ate ace gcc aaa gag 
LI gly leu leu asn ala leu leu val arg arg ser ala glu ser xle thr ala lys glu 
421/141 451/151 

cac ccg tta aaa egg teg atg gcc etc aac teg gca teg cga ctg gcg att ate acc ate 
his pro leu lys arg ser met ala leu asn ser ala ser arg leu ala lie ile thr xie 
481/161 

etc ggg ctg ate 
leu gly leu ile 

SEQ ID N0.17A' 
FIGURE 17A' 



RF.Pt .ArF.MFNT SHF-ET fRULE 26) 



60/185 



1/1 

get aga acc ccg 
ala arg thr pro 
61/21 

ggt cgc gcc gat 
gly arg ala asp 
121/41 

ctg cga tac get 
leu arg tyr ala 
181/61 

gga att gag gtg 
gly ile glu val 
241/81 

ttg ctt tec gtc 
leu leu ser val 
301/101 

tgg teg ccg gcg 
trp ser pro ala 
361/121 

tgg gtt tgc tea 
trp val cys ser 
421/141 

acc cgt taa aac 
thr arg OCH asn 
481/161 

teg ggc tga tc 
ser gly OPA 



aag gag acc teg 
lys glu thr ser 

tea cga ccg aca 
ser arg pro thr 

egg egg gcc acc 
arg arg ala thr 

ccg cag tga cga 
pro gin OPA arg 

egg ttc gcc ttt 
arg phe ala phe 

tgt teg gtc acc 
cys ser val thr 

atg ccc tgc tgg 
met pro cys trp 

ggt cga tgg ccc 
gly arg trp pro 



31/11 
egg gtt gee ggc 
arg val ala gly 

91/31 
tag gga get acc 
AMB gly ala thr 

151/51 
gat cag teg ate 
asp gin ser ile 

211/71 
cac cag cgc agg 
his gin arg arg 

271/91 
ttt tea tea acg 
phe ser ser thr 

331/111 
tga egg teg gga 
OPA arg ser gly 

391/131 
tgc ggc gtt egg 
cys gly val arg 

451/151 
tea act egg cat 
ser thr arg his 



ccc egg ccc ate 
pro arg pro ile 

cct tgg gtg att 
pro trp val ile 

ggg tgg ttt ccg 
gly trp phe pro 

acg cgc cgt tgg 
thr arg arg trp 

ttg gac tgg ccg 
leu asp trp pro 

tgt tct tgg gtc 
cys ser trp val 

ccg agt cga tea 
pro ser arg ser 

cgc gac tgg cga 
arg asp trp arg 



gga tgc gta tec 
gly cys val ser 

ccg gtg cga cga 
pro val arg arg 

etc cat cag ccc 
leu his gin pro 

tgt ttc cct ctg 
cys phe pro leu 

cag tgg cga tgt 
gin trp arg cys 

teg ggt tgc tgc 
ser gly cys cys 

ccg cca aag age 
pro pro lys ser 

tta tea cca tec 
leu ser pro ser 



SEQ ID N0.17B' 



FIGURE 17B' 



1/1 

eta gaa ccc cga agg aga cct cgc ggg ttg 
leu glu pro arg arg arg pro arg gly leu 

61/21 

gtc gcg ccg att cac gac cga cat agg gag 
val ala pro ile his asp arg his arg glu 
121/41 

tgc gat acg etc ggc ggg cca ccg ate agt 
cys asp thr leu gly gly pro pro ile ser 
181/61 

gaa ttg agg tgc cgc agt gac gac acc age 
glu leu arg cys arg ser asp asp thr ser 
241/81 

tgc ttt ccg tec ggt teg cct ttt ttt cat 
cys phe pro ser gly ser pro phe phe his 
301/101 

ggt cgc egg cgt gtt egg tea cct gac ggt 
gly arg arg arg val arg ser pro asp gly 
361/121 

ggg ttt get caa tgc cct get ggt gcg gcg 
gly phe ala gin cys pro ala gly ala ala 
421/141 

ccc gtt aaa acg gtc gat ggc cct caa etc 

pro val lys thr val asp gly pro gin leu 

481/161 

egg get gat c 

arg ala asp 



31/11 

ccg gcc ccc ggc cca teg gat gcg tat ccg 
pro ala pro gly pro ser asp ala tyr pro 
91/31 

eta ccc ctt ggg tga ttc egg tgc gac gac 
leu pro leu gly OPA phe arg cys asp asp 
151/51 

cga teg ggt ggt ttc cgc tec ate age ccg 
arg ser gly gly phe arg ser ile ser pro 
211/71 

gca gga cgc gcc gtt ggt gtt tec etc tgt 
ala gly arg ala val gly val ser leu cys 
271/91 

caa cgt tgg act ggc cgc agt ggc gat gtt 
gin arg trp thr gly arg ser gly asp val 
331/111 

egg gat gtt ctt ggg tct egg gtt get get 
arg asp val leu gly ser arg val ala ala 
391/131 

ttc ggc cga gtc gat cac cgc caa aga gca 
phe gly arg val asp his arg gin arg ala 
451/151 

ggc ate gcg act ggc gat tat cac cat cct 
gly ile ala thr gly asp tyr his his pro 



SEQ ID No,17C' 



FIGURE 17C' 
REPLACEMENT SHEET (RULE 26) 



61/185 

sequence Rvl303 predicted by Cole et al. (Nature 393:537-544) and partially containing 
SeqlTA' 

1/1 31/11 

atg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct gtt get ttc cgt ccg gtt 
met thr thr pro ala gin asp ala pro leu val phe pro ser val ala phe arg pro val 
61/21 91/31 

cgc ctt ttt ttc ate aac gtt gga ctg gcc gca gtg gcg atg ttg gtc gcc ggc gtg ttc 
arg leu phe phe ile asn val gly leu ala ala val ala met leu val ala gly val phe 
121/41 151/51 

ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg ctg ggt ttg etc aat gcc 
alv his leu thr val gly met phe leu gly leu gly leu leu leu gly leu leu asn ala 
181/61 211/71 

ctg ctg gtg egg cgt teg gcc gag teg ate ace gcc aaa gag cac ccg tta aaa egg teg 
leu leu val arg arg ser ala glu ser ile thr ala lys glu his pro leu lys arg ser 
241/81 271/91 

atg gee etc aac teg gca teg cga ctg gcg att ate ace ate etc ggg ctg ate ate gcc 
met ala leu asn ser ala ser arg leu ala ile ile thr ile leu gly leu ile ile ala 
301/101 331/111 

tac att ttc egg ccc get gga ttg ggc gtc gtg ttc ggg ctg gca ttc ttc cag gtg ctg 
tyr ile phe arg pro ala gly leu gly val val phe gly leu ala phe phe gin val leu 
361/121 391/131 

ctg gtg gca acg acg gee ctg ccg gtc ctg aag aag ctg cgc act gcg ace gag gaa ccg 
leu val ala thr thr ala leu pro val leu lys lys leu arg thr ala thr glu glu pro 
421/141 451/151 

gtc gca act tat tct tec aat ggc cag acc ggg gga teg gaa gga agg age gcc age gat 
val ala thr tyr ser ser asn gly gin thr gly gly ser glu gly arg ser ala ser asp . 
481/161 
gac tga 
asp OPA 

SEQ ID No,17D 



FIGURE 17D 



Orf according to Cole et al. (Nature 393:537-544) and containing Rvl303 
1/1 31/11 

tga ggt gcc gca gtg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct gtt get 
OPA gly ala ala val thr thr pro ala gin asp ala pro leu val phe pro ser val ala 
61/21 91/31 

ttc cgt ccg gtt cgc ctt ttt ttc ate aac gtt gga ctg gcc gca gtg gcg atg ttg gtc 
phe arg pro val arg leu phe phe ile asn val gly leu ala ala val ala met leu val 
121/41 151/51 

gee ggc gtg ttc ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg ctg ggt 
ala gly val phe gly his leu thr val gly met phe leu gly leu gly leu leu leu gly 
181/61 211/71 

ttg etc aat gee ctg ctg gtg egg cgt teg gcc gag teg ate acc gcc aaa gag cac ccg 
leu leu asn ala leu leu val arg arg ser ala glu ser ile thr ala lys glu hxs pro 
241/81 271/91 

tta aaa egg teg atg gcc etc aac teg gca teg cga ctg gcg att ate acc ate etc ggg 
leu lys arg ser met ala leu asn ser ala ser arg leu ala ile ile thr ile leu gly 
301/101 331/111 

ctg ate ate gcc tac att ttc egg ccc get gga ttg ggc gtc gtg ttc ggg ctg gca ttc 
leu ile ile ala tyr ile phe arg pro ala gly leu gly val val phe gly leu ala phe 
361/121 391/131 

ttc cag gtg ctg ctg gtg gca acg acg gee ctg ccg gtc ctg aag aag ctg cgc act gcg 
phe gin val leu leu val ala thr thr ala leu pro val leu lys lys leu arg thr ala 
421/141 451/151 

acc gag gaa ccg gtc gca act tat tct tec aat ggc cag ace ggg gga teg gaa gga agg 
thr glu glu pro val ala thr tyr ser ser asn gly gin thr gly gly ser glu gly arg 
481/161 

age gcc age gat gac tga 
ser ala ser asp asp OPA 

SEQ ID NO-17F 



FIGURE 17F 



REPLACEMENT SHEET (RULE 26) 



# 



62/185 



31/11 

GTC GAA CAG GTA CGG AAG GCG CCG TCG GTC GCT CGG TCC GOT GGT ATC TCG TGT TCA GCC 
val glu gin val arg lys ala pro ser val ala arg ser ala gly ile ser cys ser ala 
61/21 91/31 

AGC CAG CGG CCG TTA ACG TGG CCG AAC AGG TCG TCT TGG GGT CGG GCA TCA GCG TCG ATG 
ser gin arg pro leu thr trp pro asn arg ser ser trp gly arg ala ser ala ser met 
121/41 151/51 

TGG CTC AGG TCG ATA CCC GAG GGG ATG GCA AGT GTC ACC CCG CCA TCC TTC CAC CTC TTT 
trp leu arg ser ile pro glu gly met ala ser val thr pro pro ser phe his leu phe 
181/61 211/71 

TCG GGT GCA ACG ATC GGG CCA TGC CTG ACG GGG AGC AGA GCC AGC CAC CGG CCC AAG AAG 
ser gly ala thr ile gly pro cys leu thr gly ser arg ala ser his arg pro lys lys 
241/81 271/91 

ATG CGG AAG ACG ACT CGC GGC CCG ACG CCG CGG AGG CCG CCG CGG CCG AAC CCA AAT CAT 
met arg lys thr thr arg gly pro thr pro arg arg pro pro arg pro asn pro asn his 
301/101 331/111 

CAG CCG GTC CCG ATG TTC TCG ACC TAC GGT ATC GCC TCG ACA CTA CTC GGC GTG CTA TCG 
gin pro val pro met phe ser thr tyr gly ile ala ser thr leu leu gly val leu ser 
361/121 

GTC GCC GCG GTC GTG CTG GGT GCG ATG ATC 
val ala ala val val leu gly ala met ile 



SEQ ID N0.18A 
FIGURE ISA 



TCG 7\AC AGG TAC GGA AGG CGC CGT CGG TCG 
ser asn arg tyr gly arg arg arg arg ser 
62/21 

GCC AGC GGC CGT TAA CGT GGC CGA ACA GGT 
ala ser gly arg OCH arg gly arg thr gly 
122/41 

GGC TCA GGT CGA TAC CCG AGG GGA TGG CAA 
gly ser gly arg tyr pro arg gly trp gin 
182/61 

CGG GTG CAA CGA TCG GGC CAT GCC TGA CGG 
arg val gin arg ser gly his ala OPA arg 
242/81 

TGC GGA AGA CGA CTC GCG GCC CGA CGC CGC 
cys gly arg arg leu ala ala arg arg arg 
302/101 

AGC CGG TCC CGA TGT TCT CGA CCT ACG GTA 
ser arg ser arg cys ser arg pro thr val 
362/121 

TCG CCG CGG TCG TGC TGG GTG CGA TGA TC 
ser pro arg ser cys trp val arg OPA 



32/11 

CTC GGT CCG CTG GTA TCT CGT GTT CAG CCA 
leu gly pro leu val ser arg val gin pro 
92/31 

CGT CTT GGG GTC GGG CAT CAG CGT CGA TGT 
arg leu gly val gly his gin arg arg cys 
152/51 

GTG TCA CCC CGC CAT CCT TCC ACC TCT TTT 
val ser pro arg his pro ser thr ser phe 
212/71 

GGA GCA GAG CCA GCC ACC GGC CCA AGA AGA 
gly ala glu pro ala thr gly pro arg arg 
272/91 

GGA GGC CGC CGC GGC CGA ACC CAA ATC ATC 
gly gly arg arg gly arg thr gin ile ile 
332/111 

TCG CCT CGA CAC TAC TCG GCG TGC TAT CGG 
ser pro arg his tyr ser ala cys tyr arg 



SEQ ID N0.18B 
FIGURE 18B 



REPLACEMENT SHEET (RULE 26) 



63/185 



3/1 

CGA ACA GGT ACG 
arg thr gly thr 
63/21 

CCA GCG GCC GTT 
pro ala ala val 
123/41 

GCT CAG GTC GAT 
ala gin val asp 

183/61 

GGG TGC AAC GAT 
gly cys asn asp 
243/81 

GCG GAA GAC GAC 
ala glu asp asp 
303/101 

GCC GGT CCC GAT 
ala gly pro asp 
363/121 

CGC CGC GGT CGT 
arg arg gly arg 



GAA GGC GCC GTC 
glu gly ala val 

AAC GTG GCC GAA 
asn val ala glu 

ACC CGA GGG GAT 
thr arg gly asp 

CGG GCC ATG CCT 
arg ala met pro 

TCG CGG CCC GAC 
ser arg pro asp 

GTT CTC GAC CTA 
val leu asp leu 

GCT GGG TGC GAT 
ala gly cys asp 



33/11 
GGT CGC TCG GTC 
gly arg ser val 

93/31 
CAG GTC GTC TTG 
gin val val leu 

153/51 
GGC AAG TGT CAC 
gly lys cys his 

213/71 
GAC GGG GAG CAG 
asp gly glu gin 

273/91 
GCC GCG GAG GCC 
ala ala glu ala 

333/111 
CGG TAT CGC CTC 
arg tyr arg leu 

GAT C 
asp 



CGC TGG TAT CTC 
arg trp tyr leu 

GGG TCG GGC ATC 
gly ser gly ile 

CCC GCC ATC CTT 
pro ala ile leu 

AGC CAG CCA CCG 
ser gin pro pro 

GCC GCG GCC GAA 
ala ala ala glu 

GAC ACT ACT CGG 
asp thr thr arg 



GTG TTC AGC CAG 
val phe ser gin 

AGC GTC GAT GTG 
ser val asp val 

CCA CCT CTT TTC 
pro pro leu phe 

GCC CAA GAA GAT 
ala gin glu asp 

CCC AAA TCA TCA 
pro lys ser ser 

CGT GCT ATC GGT 
arg ala ile gly 



SEQ ID N0.18C 



FIGURE IBC 



part of th e nucleotide sequence of seqlSA 
fn 31/11 
GAA GGC GCC GTC GGT 
glu gly ala val gly 
61/21 

AAC GTG GCC GAA CAG 
asn val ala glu gin 
121/41 

ACC CGA GGG GAT GGC 
thr arg gly asp gly 

181/61 

CGG GCC ATG CCT GAC 
arg ala met pro asp 
241/81 

TCG CGG CCC GAC GCC 
ser arg pro asp ala 
301/101 

TTC TCG ACC TAC GGT 
phe ser thr tyr gly 
361/121 

CTG GGT GCG ATG ATC 
leu gly ala met ile 



CGC 


TCG 


GTC 


CGC 


TGG 


TAT CTC 


GTG 


TTC 


AGC 


CAG 


CCA 


GCG 


GCC 


GTT 


arg 


ser 


val 


arg 


trp 


tyr leu 


val 


phe 


ser 


gin 


pro 


ala 


ala 


val 










91/31 


















GTC 


GTC 


TTG 


GGG 


TCG 


GGC ATC 


AGC 


GTC 


GAT 


GTG 


GCT 


CAG 


GTC 


GAT 


val 


val 


leu 


gly 


ser 


gly ile 
151/51 


ser 


val 


asp 


val 


ala 


gin 


val 


asp 


AAG 


TGT 


CAC 


CCC 


GCC 


ATC CTT 


CCA 


CCT 


CTT 


TTC 


GGG 


TGC 


AAC 


GAT 


lys 


cys 


his 


pro 


ala 


ile leu 


pro 


pro 


leu 


phe 


gly 


cys 


asn 


asp 








211/71 


















GGG 


GAG 


CAG 


AGC 


CAG 


CCA CCG 


GCC 


CAA 


GAA 


GAT 


GCG 


GAA 


GAC 


GAC 


gly 


glu 


gin 


ser 


gin 


pro pro 


ala 


gin 


glu 


asp 


ala 


glu 


asp 


asp 










271/91 


















GCG 


GAG 


GCC 


GCC 


GCG 


GCC GAA 


CCC 


AAA 


TCA 


TCA 


GCC 


GGT 


CCG 


ATG 


ala 


glu 


ala 


ala 


ala 


ala glu pro 


lys 


ser 


ser 


ala 


gly pro 


met 










331/111 


















ATC 


GCC 


TCG 


ACA 


CTA 


CTC GGC 


GTG 


CTA 


TCG 


GTC 


GCC 


GCG 


GTC 


GTG 


ile 


ala 


ser 


thr 


leu 


leu gly 


val 


leu 


ser 


val 


ala 


ala 


val 


val 



SEQ ID No,18A» 



FIGURE 18A' 



REPLACEMENT SHEET (RULE 26) 



64/185 



1/1 


















31/11 


















CGG AAG 


GCG 


CCG 


TCG 


GTC 


GCT 


CGG 


TCC 


GCT 


GGT ATC 


TCG 


TGT 


TCA 


GCC 


AGC 


CAG 


CGG 


CCG 


arg lys 


ala 


pro 


ser 


val 


ala 


arg 


ser 


ala 


gly ile 


ser 


cys 


ser 


ala 


ser 


gin 


arg 


pro 


61/21 


















91/31 


















TTA ACG 


TGG 


CCG 


AAC 


AGG 


TCG 


TCT 


TGG 


GGT 


CGG GCA 


TCA 


GCG 


TCG 


ATG 


TGG 


CTC 


AGG 


TCG 


leu thr 


trp 


pro 


asn 


arg 


ser 


ser 


trp 


gly 


arg ala 


ser 


ala 


ser 


met 


trp 


leu 


arg 


ser 


121/41 


















151/51 


















ATA CCC 


GAG 


GGG 


ATG 


GCA 


AGT 


GTC 


ACC 


CCG 


CCA TCC 


TTC 


CAC 


CTC 


TTT 


TCG 


GGT 


GCA 


ACG 


ile pro 


glu 


gly 


met 


ala 


ser 


val 


thr 


pro 


pro ser 


phe 


his 


leu 


phe 


ser 


gly 


ala 


thr 


181/61 


















211/71 


















ATC GGG 


CCA 


TGC 


CTG 


ACG 


GGG 


AGC 


AGA 


GCC 


AGC CAC 


CGG 


CCC 


AAG 


AAG 


ATG 


CGG 


AAG 


ACG 


ile gly pro 


cys 


leu 


thr 


gly 


ser 


arg 


ala 


ser his 


arg 


pro 


lys 


lys 


met 


arg 


lys 


thr 


241/81 


















271/91 


















ACT CGC 


GGC 


CCG 


ACG 


CCG 


CGG 


AGG 


CCG 


CCG 


CGG CCG 


AAC 


CCA 


AAT 


CAT 


CAG 


CCG 


GTC 


CGA 


thr arg 


gly 


pro 


thr 


pro 


arg 


arg 


pro 


pro 


arg pro 


asn 


pro 


asn 


his 


gin 


pro 


val 


arg 


301/101 


















331/111 


















TGT TCT 


CGA 


CCT 


ACG 


GTA 


TCG 


CCT 


CGA 


CAC 


TAC TCG 


GCG 


TGC 


TAT 


CGG 


TCG 


CCG 


CGG 


TCG 


cys ser 


arg 


pro 


thr 


val 


ser 


pro 


arg 


his 


tyr ser 


ala 


cys 


tyr 


arg 


ser 


pro 


arg 


ser 



361/121 

TGC TGG GTG CGA TGA TC 
cys trp val arg OPA 



SEQ ID No,18B^ 
FIGURE 18B' 



1/1 


















31/11 


















GGA AGG 


CGC 


CGT 


CGG 


TCG 


CTC 


GGT 


CCG 


CTG 


GTA TCT 


CGT 


GTT 


CAG 


CCA 


GCC 


AGC 


GGC 


CGT 


gly arg 


arg 


arg 


arg 


ser 


leu 


gly 


pro 


leu 


val ser 


arg 


val 


gin 


pro 


ala 


ser 


gly 


arg 


61/21 


















91/31 


















TAA CGT 


GGC 


CGA 


ACA 


GGT 


CGT 


CTT 


GGG 


GTC 


GGG CAT 


CAG 


CGT 


CGA 


TGT 


GGC 


TCA 


GGT 


CGA 


OCH arg 


gly 


arg 


thr 


gly 


arg 


leu 


gly 


val 


gly his 


gin 


arg 


arg 


cys 


gly 


ser 


gly 


arg 


121/41 


















151/51 


















TAC CCG 


AGG 


GGA 


TGG 


CAA 


GTG 


TCA 


CCC 


CGC 


CAT CCT 


TCC 


ACC 


TCT 


TTT 


CGG 


GTG 


CAA 


CGA 


tyr pro 


arg 


gly 


trp 


gin 


val 


ser 


pro 


arg 


his pro 


ser 


thr 


ser 


phe 


arg 


val 


gin 


arg 


181/61 


















211/71 


















TCG GGC 


CAT 


GCC 


TGA 


CGG 


GGA 


GCA 


GAG 


CCA 


GCC ACC 


GGC 


CCA 


AGA 


AGA 


TGC 


GGA 


AGA 


CGA 


sor gly 


his 


ala 


OPA 


arg 


gly 


ala 


glu 


pro 


ala thr 


gly 


pro 


arg 


arg 


cys 


gly 


arg 


arg 


241/81 


















271/91 


















CTC GCG 


GCC 


CGA 


CGC 


CGC 


GGA 


GGC 


CGC 


CGC 


GGC CGA 


ACC 


CAA 


ATC 


ATC 


AGC 


CGG 


TCC 


GAT 


leu ala 


ala 


arg 


arg 


arg 


gly 


gly 


arg 


arg 


gly arg 


thr 


gin 


ile 


ile 


ser 


arg 


ser 


asp 


301/101 


















331/111 


















GTT CTC 


GAC 


CTA 


CGG 


TAT 


CGC 


CTC 


GAC 


ACT 


ACT CGG 


CGT 


GCT 


ATC 


GGT 


CGC 


CGC 


GGT 


CGT 


val leu 


asp 


leu 


arg 


tyr 


arg 


leu 


asp 


thr 


thr arg 


arg 


ala 


ile 


gly 


arg 


arg 


gly 


arg 



361/121 

GCT GGG TGC GAT GAT C 
ala gly cys asp asp 

SEQ ID N0.18C' 
FIGURE IBC* 



REPLACEMENT SHEET (RULE 26) 



65/185 



sequence Rv0199 predicted by Cole et al . (Nature 393:537-544) and containing 
seqlSA^ 

1/1 31/11 

atg act gac ggg gag cag age cag cca cog gcc caa gaa gat gcg gaa gac gac teg egg 
Met pro asp gly glu gin ser gin pro pro ala gin glu asp ala glu asp asp ser arg 
61/21 91/31 

ccc gac gcc gcg gag gcc gcc gcg gcc gaa ccc aaa tea tea gcc ggt ccg atg ttc teg 
pro asp ala ala glu ala ala ala ala glu pro lys ser ser ala gly pro met phe ser 
121/41 151/51 

acc tac ggt ate gcc teg aca eta etc ggc gtg eta teg gtc gee gcg gtc gtg ctg ggt 
thr tyr gly ile ala ser thr leu leu gly val leu ser val ala ala val val leu gly 
181/61 211/71 

gcg atg ate tgg tec gca cac cgc gat gac tee ggc gag egt acc tac ctg acc egg gtc 
ala met ile trp ser ala his arg asp asp ser gly glu arg thr tyr leu thr arg val 
241/81 271/91 

atg ctg acc gcc get gaa tgg aeg gcc gtg ctg ate aac atg aac gcc gac aac ate gat 
met leu thr ala ala glu trp thr ala val leu ile asn met asn ala asp asn ile asp 
301/101 331/111 

gee age ctg cag cga ctg cac gac gga acg gtc ggt caa etc aac acc gac ttc gac get 
ala ser leu gin arg leu his asp gly thr val gly gin leu asn thr asp phe asp ala 
361/121 391/131 

gtc gtg cag ecc tac egg cag gtg gtg gag aag ttg egg acg cac age age ggc agg ate 
val val gin pro tyr arg gin val val glu lys leu arg thr his ser ser gly arg ile 
421/141 451/151 ^ 

gag gcg gta gcg ate gat acg gtg cac cgc gag ctg gat acc cag tec ggt gcc gcc cga 
glu ala val ala ile asp thr val his arg glu leu asp thr gin ser gly ala ala arg 
481/161 511/171 

ccg gta gta acc aeg aaa ttg cca ccg ttt gee act cgc acc gac teg gtg ctg ctg gtc 
pro val val thr thr lys leu pro pro phe ala thr arg thr asp ser val leu leu val 
541/181 571/191 

gcg acg teg gtc agt gag aac gcc ggc gcc aaa ccc cag acc gtg cac tgg aac ttg egg 
ala thr ser val ser glu asn ala gly ala lys pro gin thr val his trp asn leu arg 
601/201 631/211 

etc gat gtc tee gat gtg gac ggc aag ctg atg ate tec egg ttg gag teg att cga tga 
leu asp val ser asp val asp gly lys leu met ile ser arg leu glu ser ile arg OPA 



SEQ ID No.lSD 
FIGURE 18D 



REPLACEMENT SHEET (RULE 26) 



66/185 

ORF according to Cole et al . (Nature 393:537-544) and containing Rv0199 
1/1 31/11 

taa tec gat gcc gga ttg ggt gaa atg cac caa gta acg ggt cga gtc ttt gga ate ggt 
OCH ser asp ala gly leu gly glu met his gin val thr gly arg val phe gly ile gly 
61/21 91/31 

ate gae ata gac tec gat gcc gcc gcc cac gcc ggc acg ttg cag agt gcc aag ggc ggc 
ile asp ile asp ser asp ala ala ala his ala gly thr leu gin ser ala lys gly gly 
121/41 151/51 

ggc caa ttc ggt ggc gtc ggc cgc get gtc aat cgt ggc caa ttc gtc gtg cag egg ttg 
gly gin phe gly gly val gly arg ala val asn arg gly gin phe val val gin arg leu 
181/61 211/71 

cac ecc tgc gcg etc gac ggc ttc etc gtc gag gaa get ggc gta gag gtc gcc gat gcg 
his pro cys ala leu asp gly phe leu val glu glu ala gly val glu val ala asp ala 
241/81 271/91 

ctg cge ate ggt gee tac cgc age ace tgc ttg get ggc etg gat gat eag gtc teg eac 
leu arg ile gly ala tyr arg ser thr cys leu ala gly leu asp asp gin val ser his 
301/101 331/111 

ttg tgt etc ggc gcg gtc gaa cag get acg gaa ggc gcc gtc ggt cgc teg gtc cge tgg 
leu cys leu gly ala val glu gin ala thr glu gly ala val gly arg ser val arg trp 
361/121 391/131 

tat etc gtg ttc age eag eca gcg gee gtt aac gtg gee gaa eag gtc gtc ttg ggg teg 
tyr leu val phe ser gin pro ala ala val asn val ala glu gin val val leu gly ser 
421/141 451/151 

ggc ate age gtc gat gtg get cag gtc gat ace cga ggg gat ggc aag tgt cac ecc gee 
gly ile ser val asp val ala gin val asp thr arg gly asp gly lys cys his pro ala 
481/161 511/171 

ate ctt cea cet ett ttc ggg tgc aac gat egg gee atg cet gac ggg gag cag age eag 
ile leu pro pro leu phe gly cys asn asp arg ala met pro asp gly glu gin ser gin 
541/181 571/191 

cea ecg gcc caa gaa gat gcg gaa gac gac teg egg ecc gac gcc gcg gag gcc gcc gcg 
pro pro ala gin glu asp ala glu asp asp ser arg pro asp ala ala glu ala ala ala 
601/201 631/211 

gcc gaa ecc aaa tea tea gee ggt ccg atg ttc teg ace tac ggt ate gcc teg aca eta 
ala glu pro lys ser ser ala gly pro met phe ser thr tyr gly ile ala ser thr leu 
661/221 691/231 

etc ggc gtg eta teg gtc gee gcg gtc gtg etg ggt gcg atg ate tgg tec gea cac cgc 
leu gly val leu ser val ala ala val val leu gly ala met ile trp ser ala his arg 
721/241 751/251 

gat gae tec ggc gag cgt ace tac etg ace egg gtc atg etg ace gee get gaa tgg acg 
asp asp ser gly glu arg thr tyr leu thr arg val met leu thr ala ala glu trp thr 
781/261 811/271 

gcc gtg ctg ate aac atg aac gee gae aac ate gat gcc age ctg cag cga etg cac gac 
ala val leu ile asn met asn ala asp asn ile asp ala ser leu gin arg leu his asp 
841/281 871/291 

gga acg gtc ggt caa etc aac ace gac ttc gac get gtc gtg cag ecc tac egg eag gtg 
gly thr val gly gin leu asn thr asp phe asp ala val val gin pro tyr arg gin val 
901/301 931/311 

gtg gag aag ttg egg acg cac age age ggc agg ate gag gcg gta gcg ate gat acg gtg 
val glu lys leu arg thr his ser ser gly arg ile glu ala val ala ile asp thr val 
961/321 991/331 

cac cgc gag etg gat acc cag tec ggt gee gcc cga ccg gta gta ace acg aaa ttg cea 
his arg glu leu asp thr gin ser gly ala ala arg pro val val thr thr lys leu pro 
1021/341 1051/351 

ccg ttt gcc act cgc acc gae teg gtg ctg ctg gtc gcg acg teg gtc agt gag aac gee 
pro phe ala thr arg thr asp ser val leu leu val ala thr ser val ser glu asn ala 
1081/361 1111/371 

ggc gcc aaa ecc cag acc gtg cac tgg aac ttg egg etc gat gtc tec gat gtg gac ggc 
alv ala lys pro gin thr val his trp asn leu arg leu asp val ser asp val asp gly 
1141/381 1171/391 
aag ctg atg ate tec egg ttg gag teg att cga tga 
lys leu met lie ser arg leu glu ser ile arg OPA 

SEQ ID N0.18F 



FIGURE 18F 



REPLACEMENT SHEET (RULE 26) 



67/185 



31/11 



GTT GCG 


CAA 


CGG 


GGT 


GAG 


CAC 


CGA 


CGC 


GAT 


GAT GGC 


GCA 


ACT 


ATC 


GAA 


ACT 


GCA 


GGA 


CAT 


val ala 


gin 


arg 


gly 


glu 


his 


arg 


arg 


asp 


asp gly 


ala 


thr 


ile 


glu 


thr 


ala 


gly 


his 


61/21 


















91/31 


















CGC CAA 


CGC 


CAA 


CGA 


CGG 


CAC 


TCG 


CGC 


GGT 


GGG CAC 


CCC 


TGG 


CTA 


TCA 


GGC 


CAG 


CGT 


CGA 


arg gin 


arg 


gin 


arg 


arg 


his 


ser 


arg 


gly 


gly his 


pro 


trp 


leu 


ser 


gly 


gin 


arg 


arg 


121/41 


















151/51 


















CTA TGT 


GGT 


AAA 


CAC 


ACT 


GCG 


CAA 


CAG 


CGG 


TTT TGA 


TGT 


GCA 


AAC 


CCC 


GGA 


GTT 


CTC 


CGC 


leu cys 


gly 


lys 


his 


thr 


ala 


gin 


gin 


arg 


phe OPA 


cys 


ala 


asn 


pro 


gly 


val 


leu 


arg 


181/61 


















211/71 


















TCG CGT 


GTT 


CAA 


GGC 


CGA 


AAA 


AGG 


GGT 


GGT 


GAC CCT 


CGG 


CGG 


CAA 


CAC 


CGT 


GGA 


GGC 


GAG 


ser arg 


val 


gin 


gly 


arg 


lys 


arg 


gly gly 


asp pro 


arg 


arg 


gin 


his 


arg 


gly 


gly 


glu 


241/81 


















271/91 


















GGC GCT 


CGA 


GTA 


CAG 


CCT 


CGG 


CAC 


ACC 


GCC 


GGA CGG 


GGT 


GAC 


GGG 


CCC 


GCT 


GGT 


GGC 


TGC 


gly ala 


arg 


val 


gin 


pro 


arg 


his 


thr 


ala 


gly arg 


gly 


asp 


gly 


pro 


ala 


gly 


gly 


cys 


301/101 


















331/111 


















CGC CGC 


CGA 


CGA 


CAG 


TCC 


GGG 


CTG 


CAG 


TCC 


GTC GGA 


CTA 


CGA 


CAG 


GCT 


GCC 


GGT 


GTC 


CGG 


pro arg 


arg 


arg 


gin 


ser 


gly 


leu 


gin 


ser 


val gly 


leu 


arg 


gin 


ala 


ala 


gly 


val 


arg 


O ox/ L^L 1 






































KjKu i 








TP 


























cys gj-y 




ala 




arg 










































SEQ ID 


No,19A 






























FIGURE 19A 




































32/11 


















TTG CGC 


AAC 


GGG 


GTG 


AGC 


ACC 


GAC 


GCG 


ATG 


ATG GCG 


CAA 


CTA 


TCG 


AAA 


CTG 


CAG 


GAC 


ATC 


leu arg 


asn 


gly 


val 


ser 


thr 


asp 


ala 


met 


met ala 


gin 


leu 


ser 


lys 


leu 


gin 


asp 


ile 


62/21 


















92/31 


















GCC AAC 


GCC 


AAC 


GAC 


GGC 


ACT 


CGC 


GCG 


GTG 


GGC ACC 


CCT 


GGC 


TAT 


CAG 


GCC 


AGC 


GTC 


GAC 


ala asn 


ala 


asn 


asp 


gly 


thr 


arg 


ala 


val 


gly thr 


pro 


gly 


tyr 


gin 


ala 


ser 


val 


asp 


122/41 


















152/51 


















TAT GTG 


GTA 


AAC 


ACA 


CTG 


CGC 


AAC 


AGC 


GGT 


TTT GAT 


GTG 


CAA 


ACC 


CCG 


GAG 


TTC 


TCC 


GCT 


tyr val 


val 


asn 


thr 


leu 


arg 


asn 


ser 


gly 


phe asp 


val 


gin 


thr 


pro 


glu 


phe 


ser 


ala 


182/61 


















212/71 


















CGC GTG 


TTC 


AAG 


GCC 


GAA 


AAA 


GGG 


GTG 


GTG 


ACC CTC 


GGC 


GGC 


AAC 


ACC 


GTG 


GAG 


GCG 


AGG 


arg val 


phe 


lys 


ala 


glu 


lys 


gly 


val 


val 


thr leu 


gly 


gly 


asn 


thr 


val 


glu 


ala 


arg 


242/81 


















272/91 


















GCG CTC 


GAG 


TAC 


AGC 


CTC 


GGC 


ACA 


CCG 


CCG 


GAC GGG 


GTG 


ACG 


GGC 


CCG 


CTG 


GTG 


GCT 


GCC 


ala leu 


glu 


tyr 


ser 


leu 


gly 


thr 


pro 


pro 


asp gly 


val 


thr 


gly 


pro 


leu 


val 


ala 


ala 


302/101 


















332/111 


















CCC GCC 


GAC 


GAC 


AGT 


CCG 


GGC 


TGC 


AGT 


CCG 


TCG GAC 


TAC 


GAC 


AGG 


CTG 


CCG 


GTG 


TCC 


GGT 


pro ala 


asp 


asp 


ser 


pro 


gly 


cys 


ser 


pro 


ser asp 


tyr 


asp 


arg 


leu 


pro 


val 


ser 


gly 



362/121 

GCG GTG GTG CTG GTA GAT C 
ala val val leu val asp 



SEQ ID N0.19B 
FIGURE 19B 



REPLACEMENT SHEET fRULE 26) 



68/185 



TGC GCA 


ACG 


GGG 


TGA 


GCA 


CCG 


cys ala 


thr 


gly 


OPA 


ala 


pro 


63/21 












CCA ACG 


CCA 


ACG 


ACG 


GCA 


CTC 


pro thr 


pro 


thr 


thr 


ala 


leu 


123/41 












ATG TGG 


TAA 


ACA 


CAC 


TGC 


GCA 


met trp 


OCH 


thr 


his 


cys 


ala 


183/61 












GCG TGT 


TCA 


AGG 


CCG 


AAA 


AAG 


ala cys 


ser 


arg 


pro 


lys 


lys 


243/81 












CGC TCG 


AGT 


ACA 


GCC 


TCG 


GCA 


arg ser 


ser 


thr 


ala 


ser 


ala 


303/101 












CCG CCG 


ACG 


ACA 


GTC 


CGG 


GCT 


pro pro 


thr 


thr 


val 


arg 


ala 


363/121 












CGG TGG 


TGC 


TGG 


TAG 


ATC 




arg trp 


cys 


trp 


AMB 


ile 











33/11 




ACG 


CGA 


TGA 


TGG CGC 


AAC 


thr 


arg 


OPA 


trp arg 


asn 








93/31 




GCG 


CGG 


TGG 


GCA CCC 


CTG 


ala 


arg 


trp 


ala pro 


leu 








153/51 




ACA 


GCG 


GTT 


TTG ATG 


TGC 


thr 


ala 


val 


leu met 


cys 








213/71 




GGG 


TGG 


TGA 


CCC TCG 


GCG 


gly 


trp 


OPA 


pro ser 


ala 








273/91 




CAC 


CGC 


CGG 


ACG GGG 


TGA 


his 


arg 


arg 


thr gly 


OPA 








333/111 




GCA 


GTC 


CGT 


CGG ACT 


ACG 


ala 


val 


arg 


arg thr 


thr 



SEQ ID N0.19C 
FIGURE 19C 



TAT CGA AAC TGC AGG ACA TCG 
tyr arg asn cys arg thr ser 

GCT ATC AGG CCA GCG TCG ACT 
ala ile arg pro ala ser thr 

AAA CCC CGG AGT TCT CCG CTC 
lys pro arg ser ser pro leu 

GCA ACA CCG TGG AGG CGA GGG 
ala thr pro trp arg arg gly 

CGG GCC CGC TGG TGG CTG CCC 
arg ala arg trp trp leu pro 

ACA GGC TGC CGG TGT CCG GTG 
thr gly cys arg cys pro val 



part of 


the 


nucleotide s 


equence 


of seql9A 


















1/1 


















31/11 


















CTA TCG 


AAA 


CTG 


CAG 


GAC 


ATC 


GCC 


AAC 


GCC 


AAC GAC 


GGC 


ACT 


CGC 


GCG 


GTG 


GGC 


ACC 


CCT 


leu ser 


lys 


leu 


gin 


asp 


ile 


ala 


asn 


ala 


asn asp 


gly 


thr 


arg 


ala 


val 


gly 


thr 


pro 


61/21 
















91/31 


















GGC TAT 


CAG 


GCC 


AGC 


GTC 


GAC 


TAT 


GTG 


GTA 


AAC ACA 


CTG 


CGC 


AAC 


AGC 


GGT 


TTT 


GAT 


GTG 


gly tyr 


gin 


ala 


ser 


val 


asp 


tyr 


val 


val 


asn thr 


leu 


arg 


asn 


ser 


gly phe 


asp 


val 


121/41 
















151/51 


















CAA ACC 


CCG 


GAG 


TTC 


TCC 


GCT 


CGC 


GTG 


TTC 


AAG GCC 


GAA 


AAA 


GGG 


GTG 


GTG 


ACC 


CTC 


GGC 


gin thr 


pro 


glu 


phe 


ser 


ala 


sj^g 


val 


phe 


lys ala 


glu 


lys 


gly 


val 


val 


thr 


leu 


gly 


181/61 
















211/71 


















GGC AAC 


ACC 


GTG 


GAG 


GCG 


AGG 


GCG 


CTC 


GAG 


TAC AGC 


CTC 


GGC 


ACA 


CCG 


CCG 


GAC 


GGG 


GTG 


gly asn 


thr 


val 


glu 


ala 


arg 


ala 


leu 


glu 


tyr ser 


leu gly 


thr 


pro 


pro 


asp 


gly 


val 


241/81 


















271/91 
















TAC 


ACG GGC 


CCG 


CTG 


GTG 


GCT 


GCC 


CCC 


GCC 


GAC 


GAC AGT 


CCG 


GGC 


TGC 


AGT 


CCG 


TCG 


GAC 


thr gly 


pro 


leu 


val 


ala 


ala 


pro 


ala 


asp 


asp ser 


pro 


gly 


cys 


ser 


pro 


ser 


asp 


tyr 


301/101 
















331/111 


















GAC AGG 


CTG 


CCG 


GTG 


TCC 


GGT 


GCG 


GTG 


GTG 


CTG GTA 


GAT 


C 














asp arg 


leu 


pro 


val 


ser 


gly 


ala 


val 


val 


leu val 


asp 

















SEQ ID N0.19A' 



FIGURE 19A 



REPLACEMENT SHEET (RULE 26) 



69/185 



1/1 


















31/11 


















TAT CGA 


AAC 


TGC 


AGG 


ACA 


TCG 


CCA 


ACG 


CCA 


ACG ACG 


GCA 


CTC 


GCG 


CGG 


TGG 


GCA 


CCC 


CTG 


tyr arg 


asn 


cys 


arg 


thr 


ser 


pro 


thr 


pro 


thr thr 


ala 


leu 


ala 


arg 


trp 


ala 


pro 


leu 


61/21 


















91/31 


















GCT ATC 


AGG 


CCA 


GCG 


TCG 


ACT 


ATG 


TGG 


TAA 


ACA CAC 


TGC 


GCA 


ACA 


GCG 


GTT 


TTG 


ATG 


TGC 


ala ile 


arg 


pro 


ala 


ser 


thr 


met 


trp 


OCH 


thr his 


cys 


ala 


thr 


ala 


val 


leu 


met 


cys 


121/41 


















151/51 


















AAA CCC 


CGG 


AGT 


TCT 


CCG 


CTC 


GCG 


TGT 


TCA 


AGG CCG 


AAA 


AAG 


GGG 


TGG 


TGA 


CCC 


TCG 


GCG 


lys pro 


arg 


ser 


ser 


pro 


leu 


ala 


cys 


ser 


arg pro 


lys 


lys 


gly 


trp 


OPA 


pro 


ser 


ala 


181/61 


















211/71 


















GCA ACA 


CCG 


TGG 


AGG 


CGA 


GGG 


CGC 


TCG 


AGT 


ACA GCC 


TCG 


GCA 


CAC 


CGC 


CGG 


ACG 


GGG 


TGA 


ala thr 


pro 


trp 


arg 


arg 


gly 


arg 


ser 


ser 


thr ala 


ser 


ala 


his 


arg 


arg 


thr 


gly 


OPA 


241/81 
















271/91 


















CGG GCC 


CGC 


TGG 


TGG 


CTG 


CCC 


CCG 


CCG 


ACG 


ACA GTC 


CGG 


GCT 


GCA 


GTC 


CGT 


CGG 


ACT 


ACG 


arg ala 


arg 


trp 


trp 


leu 


pro 


pro 


pro 


thr 


thr val 


arg 


ala 


ala 


val 


arg 


arg 


thr 


thr 


301/101 


















331/111 


















ACA GGC 


TGC 


CGG 


TGT 


CCG 


GTG 


CGG 


TGG 


TGC 


TGG TAG 


ATC 
















thr gly 


cys 


arg 


cys 


pro 


val 


arg 


trp 


cys 


trp AMB 


ile 

















SEQ ID No,19B' 
FIGURE 19B' 



1/1 


















31/11 


















ATC GAA 


ACT 


GCA 


GGA 


CAT 


CGC 


CAA 


CGC 


CAA 


CGA CGG 


CAC 


TCG 


CGC 


GGT 


GGG 


CAC 


CCC 


TGG 


ile glu 


thr 


ala 


gly 


his 


arg 


gin 


arg 


gin 


arg arg 


his 


ser 


arg 


gly 


gly 


his 


pro 


trp 


61/21 


















91/31 


















CTA TCA 


GGC 


CAG 


CGT 


CGA 


CTA 


TGT 


GGT 


AAA 


CAC ACT 


GCG 


CAA 


CAG 


CGG 


TTT 


TGA 


TGT 


GCA 


leu ser 


gly 


gin 


arg 


arg 


leu 


cys 


gly 


lys 


his thr 


ala 


gin 


gin 


arg 


phe 


OPA 


cys 


ala 


121/41 


















151/51 


















AAC CCC 


GGA 


GTT 


CTC 


CGC 


TCG 


CGT 


GTT 


CAA 


GGC CGA 


AAA 


AGG 


GGT 


GGT 


GAC 


CCT 


CGG 


CGG 


asn pro 


gly 


val 


leu 


arg 


ser 


arg 


val 


gin 


gly arg 


lys 


arg 


gly 


gly 


asp 


pro 


arg 


arg 


181/61 


















211/71 


















CAA CAC 


CGT 


GGA 


GGC 


GAG 


GGC 


GCT 


CGA 


GTA 


CAG CCT 


CGG 


CAC 


ACC 


GCC 


GGA 


CGG 


GGT 


GAC 


gin his 


arg 


gly 


gly 


glu 


gly 


ala 


arg 


val 


gin pro 


arg 


his 


thr 


ala 


gly 


arg 


gly 


asp 


241/81 


















271/91 


















GGG CCC 


GCT 


GGT 


GGC 


TGC 


CCC 


CGC 


CGA 


CGA 


CAG TCC 


GGG 


CTG 


CAG 


TCC 


GTC 


GGA 


CTA 


CGA 


gly pro 


ala 


gly 


gly 


cys 


pro 


arg 


arg 


arg 


gin ser 


gly 


leu 


gin 


ser 


val 


gly 


leu 


arg 


301/101 


















331/111 


















CAG GCT 


GCC 


GGT 


GTC 


CGG 


TGC 


GGT 


GGT 


GCT 


GGT AGA 


TC 
















gin ala 


ala 


gly 


val 


arg 


cys 


gly 


gly 


ala 


gly arg 



















SEQ ID N0.19C' 
FIGURE 19C^ 



REPLACEMENT SHEET (RULE 26) 



70/185 



sequence Rv0418 predicted by Cole et al. (Nature 393:537-544) and containing 
seql9A' 



1/1 31/11 



atg gtg 


aac 


aaa 


tec 


agg 


atg 


atg 


ccg 


gcg 


gtg ctg 


gcc 


gtg 


get 


gtg 


gtc 


gtc 


gca 


ttc 


Met val 


asn 


lys 


ser 


arg 


met 


met 


pro 


ala 


val leu 


ala 


val 


ala 


val 


val 


val 


ala 


phe 


61/21 


















91/31 


















ctg acg 


acg 


ggc 


tgt 


ate 


egg 


tgg 


tet 


acg 


cag teg 


egg 


cec 


gtt 


gtt 


aac 


ggc 


cec 


get 


leu thr 


thr 


gly 


cys 


ile 


arg 


trp 


ser 


thr 


gin ser 


arg 


pro 


va± 


val 


asn 


gly 


pro 


ala 


121/41 


















151/51 


















gcc gca 


gag 


ttc 


gee 


gtt 


gcg 


ttg 


cge 


aac 


egg gtg 


age 


acc 


gac 


gcg 


atg 


atg 


gcg 


cac 


ala ala 


glu 


phe 


ala 


val 


ala 


leu 


arg 


asn 


arg val 


ser 


thr 


asp 


ala 


met 


met 


ala 


his 


181/61 


















^11/ / i 


















eta teg 


aaa 


ctg 


cag 


gac 


ate 


gee 


aac 


gcc 


aac gac 


ggc 


act 


cge 


gcg 


gtg 


ggc 


ace 


ect 


leu ser 


lys 


leu 


gin 


asp 


ile 


ala 


asn 


ala 


asn asp 


gly 


thr 


arg 


ala 


val 


gly 


thr 


pro 


241/81 


















271/91 


















ggc tat 


cag 


gcc 


age 


gtc 


gac 


tat 


gtg 


gta 


aac aca 


ctg 


cge 


aac 


age 


ggt 


ttt 


gat 


gtg 


gly tyr 


gin 


ala 


ser 


val 


asp 


tyr 


val 


val 


asn thr 


leu 


arg 


asn 


ser 


gly 


phe 


asp 


val 


301/101 


















331/111 


















caa acc 


ccg 


gag 


ttc 


tec 


get 


cge 


gtg 


ttc 


aag gcc 


gaa 


aaa 


ggg 


gtg 


gtg 


ace 


etc 


ggc 


gin thr 


pro 


glu 


phe 


ser 


ala 


arg 


val 


phe 


lys ala 


glu 


lys 


gly 


val 


val 


thr 


leu 


gly 


361/121 


















391/131 


















ggc aac 


acc 


gtg 


gag 


gcg 


agg 


gcg 


etc 


gag 


tac age 


etc 


ggc 


aca 


ccg 


ccg 


gac 


ggg 


gtg 


gly asn 


thr 


val 


glu 


ala 


arg 


ala 


leu 


glu 


tyr ser 


leu 


gly 


thr 


pro 


pro 


asp 


gly 


val 


421/141 


















451/151 


















acg ggc 


ccg 


ctg 


gtg 


get 


gee 


cec 


gcc 


gac 


gac agt 


ccg 


ggc 


tgc 


agt 


ccg 


teg 


gac 


tac 


thr gly 


pro 


leu 


val 


ala 


ala 


pro 


ala 


asp 


asp ser 


pro 


gly 


cys 


ser 


pro 


ser 


asp 


tyr 


481/161 


















511/171 


















gac agg 


ctg 


ccg 


gtg 


tec 


ggt 


gcg 


gtg 


gtg 


ctg gta 


gat 


cge 


ggc 


gtc 


tgt 


ect 


ttt 


gcc 


asp arg 


leu 


pro 


val 


ser 


gly 


ala 


val 


val 


leu val 


asp 


arg 


gly 


val 


cys 


pro 


pne 


ala 


541/181 


















571/191 


















cag aag 


gaa 


gac 


gca 


gee 


gcg 


cag 


cge 


ggt 


gcg gtg 


gcg 


ctg 


ate 


att 


get 


gac 


aac 


ate 


gin lys 


glu 


asp 


ala 


ala 


ala 


gin 


arg 


gly 


ala val 


ala 


leu 


ile 


ile 


ala 


asp 


asn 


ile 


601/201 


















631/211 


















gac gag 


cag 


gcg 


atg 


ggc 


ggc 


ace 


ctg 


ggg 


get aat 


acc 


gac 


gtc 


aag 


ate 


ccg 


gtg 


gtg 


asp glu 


gin 


ala 


met 


gly 


gly 


thr 


leu 


gly 


ala asn 


thr 


asp 


val 


lys 


ile 


pro 


val 


val 


661/221 


















691/231 


















agt gtc 


ace 


aag 


teg 


gtc 


gga 


ttc 


cag 


eta 


cge gga 


cag 


tet 


ggg 


eca 


ace 


ace 


gtc 


aag 


ser val 


thr 


lys 


ser 


val 


gly phe 


gin 


leu 


arg gly gin 


ser 


gly 


pro 


thr 


thr 


val 


lys 


721/241 


















751/251 


















etc acg 


gcg 


age 


acc 


caa 


agt 


ttc 


aag 


gcc 


cge aac 


gtc 


ate 


gcg 


cag 


acg 






ggg 


leu thr 


ala 


ser 


thr 


gin 


ser 


phe 


lys 


ala 


arg asn 


val 


ile 


ala 


gin 


thr 


lys 


thr 


gly 


781/261 


















811/271 


















teg teg 


gcc 


aac 


gtg 


gtg 


atg 


gca 


ggt 


gcg 


eat ttg 


gac 


age 


gtt 


ccg 


gaa 


gga 


cec 


ggc 


ser ser 


ala 


asn 


val 


val 


met 


ala 


gly 


ala 


his leu 


asp 


ser 


val 


pro 


glu 


gly 


pro 


gly 


841/281 


















871/291 


















ate aac 


gac 


aac 


ggc 


teg 


gga 


gtg 


get 


gcg 


gtt ctg 


gaa 


acg 


gca 


gtg 


cag 


ctg 


ggg 


aac 


ile asn 


asp 


asn 


gly 


ser 


gly val 


ala 


ala 


val leu 


glu thr 


ala 


val 


gin 


leu 


gly 


asn 


901/301 


















931/311 


















tea ccg 


cat 


gtg 


tee 


aac 


gcg 


gta 


egg 


ttc 


gee ttc 


tgg 


ggc 


gee 


gag 


gaa 


ttc 


ggc 


ctg 


ser pro 


his 


val 


ser 


asn 


ala 


val 


arg 


phe 


ala phe 


trp 


gly 


ala 


glu 


glu 


phe 


gly 


leu 


961/321 


















991/331 


















att ggg 


tea 


cga 


aac 


tac 


gtc 


gag 


teg 


ctg 


gac ate 


gac 


gcg 


etc 


aaa 


ggc 


ate 


gcg 


ctg 


ile gly 


ser 


arg 


asn 


tyr 


val 


glu 


ser 


leu 


asp lie 


asp 


ala 


leu 


lys 


gly 


ile 


ala 


leu 



SEQ ID No ,19 D 
FIGURE 19D 



REPLACEMENT SHEET (RULE 26) 



72/185 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv0418 
1/1 31/11 

tag gcc att caa cgc tct gtt cgt ttg att ggt egg tgg gat gcg aaa get gcg egg cga 
AMB ala ile gin arg ser val arg leu ile gly arg trp asp ala lys ala ala arg arg 
61/21 91/31 

cag gcg egg tct aat ctg ggc gcg atg gtg aac aaa tec agg atg atg ccg gcg gtg ctg 
gin ala arg ser asn leu gly ala met val asn lys ser arg met met pro ala val leu 
121/41 151/51 

gcc gtg get gtg gtc gtc gca ttc ctg acg aeg ggc tgt ate egg tgg tct aeg cag teg 
ala val ala val val val ala phe leu thr thr gly cys ile arg trp ser thr gin ser 
181/61 211/71 

egg ccc gtt gtt aac ggc ccc get gcc gca gag ttc gcc gtt gcg ttg cgc aac egg gtg 
arg pro val val asn gly pro ala ala ala glu phe ala val ala leu arg asn arg val 
241/81 271/91 

age ace gac gcg atg atg gcg cac eta teg aaa ctg cag gac ate gcc aac gcc aac gac 
ser thr asp ala met met ala his leu ser lys leu gin asp ile ala asn ala asn asp 
301/101 331/111 

ggc act cgc gcg gtg ggc acc cet ggc tat cag gcc age gtc gac tat gtg gta aac aca 
gly thr arg ala val gly thr pro gly tyr gin ala ser val asp tyr val val asn thr 
361/121 391/131 

ctg cgc aac age ggt ttt gat gtg caa acc ccg gag ttc tee get cgc gtg ttc aag gcc 
leu arg asn ser gly phe asp val gin thr pro glu phe ser ala arg val phe lys ala 
421/141 451/151 

gaa aaa ggg gtg gtg ace etc ggc ggc aac acc gtg gag gcg agg gcg etc gag tac age 
glu lys gly val val thr leu gly gly asn thr val glu ala arg ala leu glu tyr ser 
481/161 511/171 

etc ggc aca ccg ccg gac ggg gtg acg ggc ccg ctg gtg get gcc ccc gee gac gac agt 
leu gly thr pro pro asp gly val thr gly pro leu val ala ala pro ala asp asp ser 
541/181 571/191 

ccg ggc tgc agt ccg teg gac tac gac agg ctg ccg gtg tec ggt gcg gtg gtg ctg gta 
pro gly cys ser pro ser asp tyr asp arg leu pro val ser gly ala val val leu val 
601/201 631/211 

gat cgc ggc gtc tgt cct ttt gcc cag aag gaa gac gca gcc gcg cag cgc ggt gcg gtg 
asp arg gly val cys pro phe ala gin lys glu asp ala ala ala gin arg gly ala val 
661/221 691/231 

gcg ctg ate att get gac aac ate gac gag cag gcg atg ggc ggc acc ctg ggg get aat 
ala leu ile ile ala asp asn ile asp glu gin ala met gly gly thr leu gly ala asn 
721/241 751/251 

acc gac gtc aag ate ccg gtg gtg agt gtc acc aag teg gtc gga ttc cag eta cgc gga 
thr asp val lys ile pro val val ser val thr lys ser val gly phe gin leu arg gly 
781/261 811/271 

cag tct ggg eca acc acc gtc aag etc acg gcg age acc caa agt ttc aag gcc cgc aac 
gin ser gly pro thr thr val lys leu thr ala ser thr gin ser phe lys ala arg asn 
841/281 871/291 

gtc ate gcg cag acg aag aeg ggg teg teg gcc aac gtg gtg atg gca ggt gcg cat ttg 
val ile ala gin thr lys thr gly ser ser ala asn val val met ala gly ala his leu 
901/301 931/311 

gac age gtt ccg gaa gga ccc ggc ate aac gac aac ggc teg gga gtg get gcg gtt ctg 
asp ser val pro glu gly pro gly ile asn asp asn gly ser gly val ala ala val leu 
961/321 991/331 

gaa acg gca gtg cag ctg ggg aac tea ccg cat gtg tec aac gcg gta egg ttc gcc ttc 
glu thr ala val gin leu gly asn ser pro his val ser asn ala val arg phe ala phe 



SEQ ID No, 19 F 



FIGURE 19F 



REPLACEMENT SHEET (RULE 26) 



73/185 



1021/341 

tgg ggc gcc gag gaa ttc ggc ctg att ggg 
trp gly ala glu glu phe gly leu lie gly 
1081/361 

gac gcg etc aaa ggc ate gcg ctg tat ctg 
asp ala leu lys gly lie ala leu tyr leu 
1141/381 

ggt tac ttc acc tac gac ggt gac cag teg 
gly tyr phe thr tyr asp gly asp gin ser 
1201/401 

gtg cec gaa ggc teg gcc ggt ate gag cgc 
val pro glu gly ser ala gly lie glu arg 
1261/421 

aag acc gcg cag gac acc teg ttc gac ggt 
lys thr ala gin asp thr ser phe asp gly 
1321/441 

ggt ate cet teg ggt ggc ctg ttc tec ggc 
gly lie pro ser gly gly leu phe ser gly 
1381/461 

gag etc tgg ggc ggc acc gcc gac gag cet 
glu leu trp gly gly thr ala asp glu pro 
1441/481 

acc ctg gac cat ate gac cgc acc gcg etc 
thr leu asp his lie asp arg thr ala leu 
1501/501 

gtg ggt ttg tat gcg cag gac etc ggc ggc 
val gly leu tyr ala gin asp leu gly gly 
1561/521 

acc cgc cac ctg att gcc aaa ccg tga 
thr arg his leu ile ala lys pro OPA 



1051/351 

tea cga aac tac gtc gag teg ctg gac ate 
ser arg asn tyr val glu ser leu asp ile 
1111/371 

aac ttc gac atg ttg gcg teg ccg aac ccg 
asn phe asp met leu ala ser pro asn pro 
1171/391 

ctg ccg eta gac gcc cgc ggt cag ccg gtg 
leu pro leu asp ala arg gly gin pro val 
1231/411 

acg ttc gtc gcc tat ctg aag atg gcc ggc 
thr phe val ala tyr leu lys met ala gly 
1291/431 

egg tec gac tac gac ggc ttc acg ctg gcg 
arg ser asp tyr asp gly phe thr leu ala 
1351/451 

get gag gtc aag aag tec gcc gag caa gee 
ala glu val lys lys ser ala glu gin ala 
1411/471 

ttc gat cec aac tat cac cag aag aca gac 
phe asp pro asn tyr his gin lys thr asp 
1471/491 

ggt ate aac ggc get ggc gtc gcg tac gcg 
gly ile asn gly ala gly val ala -tyr ala 
1531/511 

cec aac ggg gtt ccg gtc atg gcg gac cgc 
pro asn gly val pro val met ala asp arg 



SEQ ID N0.19F (continued) 
FIGURE 19F (continued) 



CGA GAC AGT GGT 
arg asp ser gly 

61/21 

CGC GGT GTG GGA 
arg gly val gly 
121/41 

GGC CGG CTG GCA 
gly arg leu ala 
181/61 

TGA GCT cec CGT 
OPA ala pro arg 
241/81 

GCA CAT GGT GCC 
ala his gly ala 
301/101 

TCG GTG CGT GCA 
ser val arg ala 
361/121 

CCG ACA AAT CCG 
pro thr asn pro 



GCG GGA CAC TTG 
ala gly his leu 

CTC ACG TTC GGT 
leu thr phe gly 

ACC GTC GGT TTC 
thr val gly phe 

TTT GCT GGA TGC 
phe ala gly cys 

GGC AGG GAG GAA 
gly arg glu glu 

TGC TCG CAG CCG 
cys ser gin pro 

GAC CAC TGC ATC 
asp his cys ile 



31/11 
AGT TCG GCT GCT 
ser ser ala ala 

91/31 
GAG GGT ACA GCG 
glu gly thr ala 

151/51 
GAC GTT GGT GAC 
asp val gly asp 

211/71 
CCA GGC ACC GCC 
pro gly thr ala 

271/91 
CAG TGG GCA AGC 
gin trp ala ser 

331/111 
GAT GCA CCA ACG 
asp ala pro thr 

AGG ATC 

arg ile 

SEQ ID No.20i 



AAC GAC GCC AGA 
asn asp ala arg 

GAC CTT CGA GCA 
asp leu arg ala 

GAC CCC TCG TTC 
asp pro ser phe 

GGT ACT GCT GCG 
gly thr ala ala 

AGC TAG CCG CGC 
ser ANB pro arg 

TGG TCG ACG GGA 
trp ser thr gly 



GTC GCC CGC TTC 
val ala arg phe 

CGC AAT ATC GTG 
arg asn ile val 

ATG AAT CGT TCT 
met asn arg ser 

CTT AAG CTT GTC 
leu lys leu val 

TCG CCG CGC TGG 
ser pro arg trp 

CCG CCG TGG CTG 
pro pro trp leu 



FIGURE 20A 
REPLACEMENT SHEET (RULE 26) 



74/185 



32/11 



GAG ACA 


GTG 


GTG 


CGG 


GAC 


ACT 


TGA 


GTT 


CGG 


CTG CTA 


ACG 


ACG 


CCA 


GAG 


TCG 


CCC 


GCT 


TCC 


glu thr 


val 


val 


arg 


asp 


thr 


OPA 


val 


arg 


leu leu 


thr 


thr 


pro 


glu 


ser 


pro 


ala 


ser 


62/21 


















92/31 


















GCG GTG 


TGG 


GAC 


TCA 


CGT 


TCG 


GTG 


AGG 


GTA 


CAG CGG 


ACC 


TTC 


GAG 


CAC 


GCA 


ATA 


TCG 


TGG 


ala val 


trp 


asp 


ser 


arg 


ser 


val 


arg 


val 


gin arg 


thr 


phe 


glu 


his 


ala 


ile 


ser 


trp 


122/41 


















152/51 


















GCC GGC 


TGG 


CAA 


CCG 


TCG 


GTT 


TCG 


ACG 


TTG 


GTG ACG 


ACC 


CCT 


CGT 


TCA 


TGA 


ATC 


GTT 


CTT 


ala gly 


trp 


gin 


pro 


ser 


val 


ser 


thr 


leu 


val thr 


thr 


pro 


arg 


ser 


OPA 


ile 


val 


leu 


182/61 


















212/71 


















GAG CTC 


CCC 


GTT 


TTG 


CTG 


GAT 


GCC 


CAG 


GCA 


CCG CCG 


GTA 


CTG 


CTG 


CGC 


TTA 


AGC 


TTG 


TCG 


glu leu 


pro 


val 


leu 


leu 


asp 


ala 


gin 


ala 


pro pro 


val 


leu 


leu 


arg 


leu 


ser 


leu 


ser 


242/81 
















272/91 


















CAC ATG 


GTG 


CCG 


GCA 


GGG 


AGG 


AAC 


AGT 


GGG 


CAA GCA 


GCT 


AGC 


CGC 


GCT 


CGC 


CGC 


GCT 


GGT 


his met 


val 


pro 


ala 


gly arg 


asn 


ser 


gly 


gin ala 


ala 


ser 


arg 


ala 


arg 


arg 


ala 


gly 


302/101 


















332/111 


















CGG TGC 


GTG 


CAT 


GCT 


CGC 


AGC 


CGG 


ATG 


CAC 


CAA CGT 


GGT 


CGA 


CGG 


GAC 


CGC 


CGT 


GGC 


TGC 


arg cys 


val 


his 


ala 


arg 


ser 


arg 


met 


his 


gin arg 


gly 


arg 


arg 


asp 


arg 


arg 


gly 


cys 


362/121 




































CGA CAA 


ATC 


CGG 


ACC 


ACT 


GCA 


TCA 


GGA 


TC 




















arg gin 


ile 


arg 


thr 


thr 


ala 


ser 


gly 























SEQ ID No. 2 OB 
FIGURE 20B 



AGA CAG TGG TGC GGG 
arg gin trp cys gly 
63/21 

CGG TGT GGG ACT CAC 
arg cys gly thr his 
123/41 

CCG GCT GGC AAC CGT 
pro ala gly asn arg 

183/61 

AGC TCC CCG TTT TGC 
ser ser pro phe cys 
243/81 

ACA TGG TGC CGG CAG 
thr trp cys arg gin 
303/101 

GGT GCG TGC ATG CTC 
gly ala cys met leu 

363/121 

GAC AAA TCC GGA CCA 
asp lys ser gly pro 



ACA 


CTT 


GAG 


TTC 


GGC 


thr 


leu 


glu phe 


gly 


GTT 


CGG 


TGA 


GGG 


TAC 


val 


arg 


OPA gly 


tyr 


CGG 


TTT 


CGA 


CGT 


TGG 


arg 


phe 


arg 


arg 


trp 


TGG 


ATG 


CCC 


AGG 


CAC 


trp 


met 


pro 


arg 


his 


GGA 


GGA 


ACA 


GTG 


GGC 


gly 


gly 


thr 


val 


gly 


GCA 


GCC 


GGA 


TGC 


ACC 


ala 


ala 


gly 


cys 


thr 


CTG 


CAT 


CAG 


GAT 


C 


leu 


his 


gin 


asp 





33/11 

TGC TAA CGA CGC CAG 
cys OCH arg arg gin 
93/31 

AGC GGA CCT TCG AGC 
ser gly pro ser ser 
153/51 

TGA CGA CCC CTC GTT 
OPA arg pro leu val 

213/71 

CGC CGG TAC TGC TGC 
arg arg tyr cys cys 
273/91 

AAG CAG CTA GCC GCG 
lys gin leu ala ala 
333/111 

AAC GTG GTC GAC GGG 
asn val val asp gly 



AGT CGC CCG CTT CCG 
ser arg pro leu pro 

ACG CAA TAT CGT GGG 
thr gin tyr arg gly 

CAT GAA TCG TTC TTG 
his glu ser phe leu 

GCT TAA GCT TGT CGC 
ala OCH ala cys arg 

CTC GCC GCG CTG GTC 
leu ala ala leu val 

ACC GCC GTG GCT GCC 
thr ala val ala ala 



SEQ ID NO.20C 
FIGURE 20C 
REPLACEMENT SHEET (RULE 26) 



75/185 



part of the nucleotide sequence of seq20A 



1/1 


















TGT GGG 


ACT 


CAC 


GTT 


CGG 


TGA 


GGG 


TAC 


AGC 


cys gly 


thr 


his 


val 


arg 


OPA 


gly 


tyr 


ser 


61/21 


















OCT GGC 


AAC 


CGT 


CGG 


TTT 


CGA 


CGT 


TGG 


TGA 


ala gly 


asn 


arg 


arg 


phe 


arg 


arg 


trp 


OPA 


121/41 


















TCC CCG 


TTT 


TGC 


TGG 


ATG 


CCC 


AGG 


CAC 


CGC 


ser pro 


phe 


cys 


trp 


met 


pro 


arg 


his 


arg 


181/61 


















TGG TGC 


CGG 


CAG 


GGA 


GGA 


ACA 


GTG 


GGC 


AAG 


trp cys 


arg 


gin 


gly 


gly 


thr 


val 


gly 


lys 


241/81 


















GCG TGC 


ATG 


CTC 


GCA 


GCC 


GGA 


TGC 


ACC 


AAC 


ala cys 


met 


leu 


ala 


ala 


gly 


cys 


thr 


asn 


301/101 


















AAA TCC 


GGA 


CCA 


CTG 


CAT 


CAG 


GAT 


C 




lys ser 


gly 


pro 


leu 


his 


gin 


asp 







31/11 

GGA CCT TCG AGC ACG CAA TAT CGT GGG CCG 
gly pro ser ser thr gin tyr arg gly pro 
91/31 

CGA CCC CTC GTT CAT GAA TCG TTC TTG AGC 
arg pro leu val his glu ser phe leu ser 
151/51 

CGG TAC TGC TGC GCT TAA GCT TGT CGC ACA 
arg tyr cys cys ala OCH ala cys arg thr 
211/71 

CAG CTA GCC GCG CTC GCC GCG CTG GTC GGT 
gin leu ala ala leu ala ala leu val gly 
271/91 

GTG GTC GAC GGG ACC GCC GTG GCT GCC GAC 
val val asp gly thr ala val ala ala asp 



SEQ ID NO.20A* 
FIGURE 2 OA' 



1/1 

GTG GGA CTC ACG 
val gly leu thr 
61/21 

CTG GCA ACC GTC 
leu ala thr val 
121/41 

CCC CGT TTT GCT 
pro arg phe ala 
181/61 

GGT GCC GGC AGG 
gly ala gly arg 
241/81 

CGT GCA TGC TCG 
arg ala cys ser 
301/101 

AAT CCG GAC CAC 
asn pro asp his 



TTC GGT GAG GGT 
phe gly glu gly 

GGT TTC GAC GTT 
gly phe asp val 

GGA TGC CCA GGC 
gly cys pro gly 

GAG GAA CAG TGG 
glu glu gin trp 

CAG CCG GAT GCA 
gin pro asp ala 

TGC ATC AGG ATC 
cys ile arg ile 







31/11 


ACA 


GCG 


GAC CTT 


thr 


ala 


asp leu 






91/31 


GGT 


GAC 


GAC CCC 


gly 


asp 


asp pro 






151/51 


ACC 


GCC 


GGT ACT 


thr 


ala 


gly thr 






211/71 


GCA 


AGC 


AGC TAG 


ala 


ser 


ser AMB 






271/91 


CCA 


ACG 


TGG TCG 


pro 


thr 


trp ser 



SEQ ID No,20B 
FIGURE 20B' 



CGA GCA CGC AAT 
arg ala arg asn 

TCG TTC ATG AAT 
ser phe met asn 

GCT GCG CTT AAG 
ala ala leu lys 

CCG CGC TCG CCG 
pro arg ser pro 

ACG GGA CCG CCG 
thr gly pro pro 



ATC GTG GGC CGG 

ile val gly arg 

CGT TCT TGA GCT 

arg ser OPA ala 

CTT GTC GCA CAT 

leu val ala his 

CGC TGG TCG GTG 

arg trp ser val 

TGG CTG CCG ACA 

trp leu pro thr 



REPLACEMENT SHEET (RULE 26) 



76/185 



1/1 31/11 



GTG TGG 


GAC 


TCA 


CGT 


TCG 


GTG 


AGG 


GTA 


CAG CGG ACC 


TTC 


GAG 


CAC 


GCA 


ATA 


TCG 


TGG 


GCC 


val trp 


asp 


ser 


arg 


ser 


val 


arg 


val 


gin arg thr 


phe 


glu 


his 


ala 


xle 


ser 


trp 


ala 


61/21 
















91/31 


















GGC TGG 


CAA 


CCG 


TCG 


GTT 


TCG 


ACG 


TTG 


GTG ACG ACC 


CCT 


CGT 


TCA 


TGA 


ATC 


GTT 


CTT 


GAG 


gly trp 


gin pro 


ser 


val 


ser 


thr 


leu 


val thr thr 


pro 


arg 


ser 


OPA 


ile 


val 


leu 


glu 


121/41 
















151/51 


















CTC CCC 


GTT 


TTG 


CTG 


GAT 


GCC 


CAG 


GCA 


CCG CCG GTA 


CTG 


CTG 


CGC 


TTA 


AGC 


TTG 


TCG 


CAC 


leu pro 


val 


leu 


leu 


asp 


ala 


gin 


ala 


pro pro val 


leu 


leu 


arg 


leu 


ser 


leu 


ser 


his 


181/61 
















211/71 


















ATG GTG 


CCG 


GCA 


GGG 


AGG 


AAC 


AGT 


GGG 


CAA GCA GCT 


AGC 


CGC 


GCT 


CGC 


CGC 


GCT 


GGT 


CGG 


met val 


pro 


ala 


gly 


arg 


asn 


ser 


gly 


gin ala ala 


ser 


arg 


ala 


arg 


arg 


ala 


gly 


arg 


241/81 














271/91 


















TGC GTG 


CAT 


GCT 


CGC 


AGC 


CGG 


ATG 


CAC 


CAA CGT GGT 


CGA 


CGG 


GAC 


CGC 


CGT 


GGC 


TGC 


CGA 


cys val 


his 


ala 


arg 


ser 


arg 


met 


his 


gin arg gly 


arg 


arg 


asp 


arg arg gly cys 


arg 


301/101 


































CAA ATC 


CGG 


ACC 


ACT 


GCA 


TCA 


GGA 


TC 




















gin ile 


arg 


thr 


thr 


ala 


ser 


gly 























SEQ ID No,20C' 



FIGURE 20C' 



sequence Rv3576 predicted by Cole et al. (Nature 393:537-544) and containing seq20A' 
1/1 31/11 

atg ggc aag cag eta gcc gcg etc gcc gcg ctg gtc ggt gcg tgc atg etc gca gcc gga 
met gly lys gin leu ala ala leu ala ala leu val gly ala cys met leu ala ala gly 
61/21 91/31 

tgc acc aac gtg gtc gac ggg acc gcc gtg get gee gae aaa tec gga cca etg cat cag 
cys thr asn val val asp gly thr ala val ala ala asp lys ser gly pro leu his gin 
121/41 151/51 

gat ccg ata ccg gtt tea gcg ctt gaa ggg ctg ctt etc gac ttg age cag ate aat gcc 
asp pro ile pro val ser ala leu glu gly leu leu leu asp leu ser gin ile asn ala 
181/61 211/71 

gcg ctg ggt gcg aca teg atg aag gtg tgg ttc aac gcc aag gca atg tgg gac tgg age 
ala leu gly ala thr ser met lys val trp phe asn ala lys ala met trp asp trp ser 
241/81 271/91 

aag age gtg gcc gac aag aat tgc ctg get ate gac ggt cca gca cag gaa aag gtc tat 
lys ser val ala asp lys asn cys leu ala ile asp gly pro ala gin glu lys val tyr 
301/101 331/111 

gcc ggc acc ggg tgg acc get atg cgc ggc caa egg ctg gat gac age ate gat gac tec 
ala gly thr gly trp thr ala met arg gly gin arg leu asp asp ser ile asp asp ser 
361/121 391/131 

aag aaa cgc gae cae tae gee att caa gcg gtc gtc ggc ttc ccg ace gca eat gat gcc 
lys lys arg asp his tyr ala ile gin ala val val gly phe pro thr ala his asp ala 
421/141 451/151 

gag gag ttc tae age tec teg gtg caa age tgg age age tgc teg aac cgc egg ttt gtc 
glu glu phe tyr ser ser ser val gin ser trp ser ser cys ser asn arg arg phe val 
481/161 511/171 

gaa gtc acc ccc gga cag gac gae gee gcc tgg act gtg get gac gtt gtc aac gac aac 
glu val thr pro gly gin asp asp ala ala trp thr val ala asp val val asn asp asn 
541/181 571/191 

ggc atg etc agt age teg cag gtt cag gaa ggc ggc gae gga tgg acc tgc cag cgt gcc 
gly met leu ser ser ser gin val gin glu gly gly asp gly trp thr cys gin arg ala 
601/201 631/211 

etg act gcg cgc aac aac gtc act ate gac att gtc acg tgc gee tat age caa ccg gat 
leu thr ala arg asn asn val thr ile asp ile val thr cys ala tyr ser gin pro asp 
661/221 691/231 

ttg gtg gcg att ggc ate get aac caa ate gcg gcc aag gtt get aag cag tag 
leu val ala ile gly ile ala asn gin ile ala ala lys val ala lys gin AMB 

SEQ ID NO.20D 



FIGURE 20D 



REPLACEMENT SHEET (RULE 26) 



77/185 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv3576 



1/1 

taa get tgt cgc aca tgg tgc 
OCH ala cys arg thr trp cys 
61/21 

gcc gcg ctg gtc ggt gcg tgc 
ala ala leu val gly ala cys 
121/41 

gcc gtg get gcc gac aaa tec 
ala val ala ala asp lys ser 
181/61 

gaa ggg ctg ctt etc gae ttg 
glu gly leu leu leu asp leu 
241/81 

gtg tgg ttc aac gcc aag gca 
val trp phe asn ala lys ala 
301/101 

ctg get ate gac ggt cca gca 
leu ala ile asp gly pro ala 
361/121 

cgc ggc eaa egg ctg gat gae 
arg gly gin arg leu asp asp 
421/141 

caa gcg gtc gtc ggc ttc ccg 
gin ala val val gly phe pro 
481/161 

caa age tgg age age tgc teg 
gin ser trp ser ser cys ser 
541/181 

gcc gcc tgg act gtg get gae 
ala ala trp thr val ala asp 
601/201 

cag gaa ggc ggc gac gga tgg 
gin glu gly gly asp gly trp 
661/221 

ate gac att gtc acg tgc gcc 
ile asp ile val thr cys ala 
721/241 

caa ate gcg gcc aag gtt get 
gin ile ala ala lys val ala 



31/11 

egg cag gga gga aca gtg 
arg gin gly gly thr val 
91/31 

atg etc gca gee gga tgc 
met leu ala ala gly cys 

151/51 
gga cca ctg cat cag gat 
gly pro leu his gin asp 

211/71 
age eag ate aat gcc gcg 
ser gin ile asn ala ala 

271/91 
atg tgg gac tgg age aag 
met trp asp trp ser lys 

331/111 
cag gaa aag gtc tat gcc 
gin glu lys val tyr ala 

391/131 
age ate gat gae tec aag 
ser ile asp asp ser lys 

451/151 
ace gca cat gat gcc gag 
thr ala his asp ala glu 

511/171 
aac cgc egg ttt gtc gaa 
asn arg arg phe val glu 

571/191 
gtt gtc aac gac aac ggc 
val val asn asp asn gly 

631/211 
ace tgc cag cgt gcc ctg 
thr cys gin arg ala leu 

691/231 
tat age caa ccg gat ttg 
tyr ser gin pro asp leu 

aag eag tag 
lys gin AMB 



ggc aag cag eta gcc gcg etc 
gly lys gin leu ala ala leu 

ace aac gtg gtc gae ggg ace 
thr asn val val asp gly thr 

ccg ata ccg gtt tea gcg ctt 
pro ile pro val ser ala leu 

ctg ggt gcg aca teg atg aag 
leu gly ala thr ser met lys 

age gtg gcc gac aag aat tgc 
ser val ala asp lys asn cys 

ggc ace ggg tgg acc get atg 
gly thr gly trp thr ala met 

aaa cgc gac cac tac gcc att 
lys arg asp his tyr ala ile 

gag ttc tac age tec teg gtg 
glu phe tyr ser ser ser val 

gtc acc cec gga eag gac gac 
val thr pro gly gin asp asp 

atg etc agt age teg cag gtt 
met leu ser ser ser gin val 

act gcg cgc aac aac gtc act 
thr ala arg asn asn val thr 

gtg gcg att ggc ate get aac 
val ala ile gly ile ala asn 



SEQ ID No, 2 OF 
FIGURE 20F 



REPLACEMENT SHEET (RULE 26) 



78/185 



1/1 



GTC CTG 


GTC 


GCC 


GCG 


CAA 


CTG 


GCC 


GGT 


CCC 


val leu 


val 


ala 


ala 


gin 


leu 


ala 


gly pro 


61/21 


















CGC TGG 


TAG 


TGG 


CGA 


TGG 


TGT 


TAG 


CAG 


GAT 


arg trp 


AMB 


trp 


arg 


trp 


cys 


AMB 


gin 


asp 


121/41 


















GCG GCC 


TGC 


GGC 


TGG 


TCG 


CCG 


CGC 


GCG 


CCG 


ala ala 


cys 


gly 


trp 


ser 


pro 


arg 


ala 


pro 


181/61 


















CGG CGC 


TGG 


ACG 


TCG 


CCG 


TGC 


TGG 


CCA 


GCT 


arg arg 


trp 


thr 


ser 


pro 


cys 


trp 


pro 


ala 


241/81 


















AAA ACT 


TCA 


CCG 


ccc 


GCA 


AGT 


ACG 


AGC 


TGC 


lys thr 


ser 


pro 


pro 


ala 


ser 


thr 


ser 


cys 


301/101 


















CAG ACG 


TGC 


GGT 


CGG 


GAG 


TGA 


ACA 


CGC 


TGC 


gin thr 


cys 


gly 


arg 


glu 


OPA 


thr 


arg 


cys 


361/121 


















TGC TGG 


CCG 


ACA 


GCA 


TCG 


GCT 


TGC 


GGG 


ATC 


cys trp 


pro 


thr 


ala 


ser 


ala 


cys 


gly 


lie 



31/11 

GAT GGA AAG TGT TCA CGA TCG CGC TTC TGC 
asp gly lys cys ser arg ser arg phe cys 
91/31 

TGC GGG TCG AGG CTG CGA TGG CCA GCA CCA 
cys gly ser arg leu arg trp pro ala pro 
151/51 

AAA TGA TAC CCG CGA TCA CGA AAT ACA TGT 
lys OPA tyr pro arg ser arg asn thr cys 
211/71 

CGA CCG GAC ACG ATG TGG AGG GGG CGC AGA 
arg pro asp thr met trp arg gly arg arg 
271/91 

AGA CGC GAC TGG CCG ACA CCG ACG TCA TCG 
arg arg asp trp pro thr pro thr ser ser 
331/111 

TCA ACG GCG GTC AGG CGC TGC TGG ATA AGA 
ser thr ala val arg arg cys trp ile arg 



SEQ ID N0.21A 
FIGURE 21A 



TCC TGG TCG CCG CGC AAC 
ser trp ser pro arg asn 
62/21 

GCT GGT AGT GGC GAT GGT 
ala gly ser gly asp gly 
122/41 

CGG CCT GCG GCT GGT CGC 
arg pro ala ala gly arg 
182/61 

GGC GCT GGA CGT CGC CGT 
gly ala gly arg arg arg 

242/81 

AAA CTT CAC CGC CCG CAA 
lys leu his arg pro gin 
302/101 

AGA CGT GCG GTC GGG AGT 
arg arg ala val gly ser 
362/121 

GCT GGC CGA CAG CAT CGG 
ala gly arg gin his arg 











32/11 




TGG 


CCG 


GTC 


CCG 


ATG GAA 


AGT 


trp 


pro 


val 


pro 


met glu 


ser 










92/31 




GTT 


AGC 


AGG 


ATT 


GCG GGT 


CGA 


val 


ser 


arg 


ile 


ala gly 


arg 










152/51 




CGC 


GCG 


CGC 


CGA 


AAT GAT 


ACC 


arg 


ala 


arg 


arg 


asn asp 


thr 










212/71 




GCT 


GGC 


CAG 


CTC 


GAC CGG 


ACA 


ala 


gly gin 


leu 


asp arg 


thr 










272/91 




GTA 


CGA 


GCT 


GCA 


GAC GCG 


ACT 


val 


arg 


ala 


ala 


asp ala 


thr 










332/111 




GAA 


CAC 


GCT 


GCT 


CAA CGG 


CGG 


glu 


his 


ala 


ala 


gin arg 


arg 


CTT 


GCG 


GGA 


TC 






leu 


ala 


gly 









SEQ ID No. 2 IB 



FIGURE 21B 



GTT CAC GAT CGC GCT TCT GCC 
val his asp arg ala ser ala 

GGC TGC GAT GGC CAG CAC CAG 
gly cys asp gly gin his gin 

CGC GAT CAC GAA ATA CAT GTC 

arg asp his glu ile his val 

CGA TGT GGA GGG GGC GCA GAA 
arg cys gly gly gly ala glu 

GGC CGA CAC CGA CGT CAT CGC 
gly arg his arg arg his arg 

TCA GGC GCT GCT GGA TAA GAT 
ser gly ala ala gly OCH asp 



REPLACEMENT SHEET (RULE 26) 



79/185 



33/11 

CGG TCC CGA TGG AAA GTG TTC ACG ATC GCG CTT CTG CCG 

arg ser arg trp lys val phe thr ile ala leu leu pro 
93/31 

GCA GGA TTG CGG GTC GAG GCT GCG ATG GCC AGC ACC AGO 

ala gly leu arg val glu ala ala met ala ser thr ser 
153/51 

CGC GCC GAA ATG ATA CCC GCG ATC ACG AAA TAC ATG TCG 

arg ala glu met ile pro ala ile thr lys tyr met ser 
213/71 

GCC AGC TCG ACC GGA CAC GAT GTG GAG GGG GCG CAG AAA 

ala ser ser thr gly his asp val glu gly ala gin lys 
273/91 

GAG CTG CAG ACG CGA CTG GCC GAC ACC GAC GTC ATC GCA 

glu leu gin thr arg leu ala asp thr asp val ile ala 
333/111 

ACG CTG CTC AAC GGC GGT GAG GCG CTG CTG GAT AAG ATG 

thr leu leu asn gly gly gin ala leu leu asp lys met 

CGG GAT C 
arg asp 

SEQ ID N0.21C 
FIGURE 21C 



part of the nucleotide sequence of seq21A 



1/1 


















31/11 


















ACG ATC 


GCG 


CTT 


CTG 


CCG 


CTG 


GTA 


GTG 


GCG 


ATG GTG 


TTA 


GCA 


GGA 


TTG 


CGG 


GTC 


GAG 


GCT 


thr ile 


ala 


leu 


leu 


pro 


leu 


val 


val 


ala 


met val 


leu 


ala 


gly 


leu 


arg 


val 


glu 


ala 


61/21 


















91/31 
















GCG ATG 


GCC 


AGC 


ACC 


AGC 


GGC 


CTG 


CGG 


CTG 


GTC GCC 


GCG 


CGC 


GCC 


GAA 


ATG 


ATA 


CCC 


GCG 


ala met 


ala 


ser 


thr 


ser 


gly 


leu 


arg 


leu 


val ala 


ala 


arg 


ala 


glu 


met 


ile 


pro 


ala 


121/41 


















151/51 
















ATC ACG 


AAA 


TAC 


ATG 


TCG 


GCG 


CTG 


GAC 


GTC 


GCC GTG 


CTG 


GCC 


AGC 


TCG 


ACC 


GGA 


CAC 


GAT 


ile thr 


lys 


tyr 


met 


ser 


ala 


leu 


asp 


val 


ala val 


leu 


ala 


ser 


ser 


thr 


gly 


his 


asp 


181/61 


















211/71 














GTG GAG 


GGG 


GCG 


CAG 


AAA 


AAC 


TTC 


ACC 


GCC 


CGC AAG 


TAC 


GAG 


CTG 


CAG 


ACG 


CGA 


CTG 


GCC 


val glu 


gly 


ala 


gin 


lys 


asn 


phe 


thr 


ala 


arg lys 


tyr 


glu 


leu 


gin 


thr 


arg 


leu 


ala 


241/81 


















271/91 


















GAC ACC 


GAC 


GTC 


ATC 


GCA 


GAC 


GTG 


CGG 


TCG 


GGA GTG 


AAC 


ACG 


CTG 


CTC 


AAC 


GGC 


GGT 


CAG 


asp thr 


asp 


val 


ile 


ala 


asp 


val 


arg 


ser 


gly val 


asn 


thr 


leu 


leu 


asn 


gly 


gly 


gin 


301/101 


















331/111 


















GCG CTG 


CTG 


GAT 


AAG 


ATG 


CTG 


GCC 


GAC 


AGC 


ATC GGC 


TTG 


CGG 


GAT 


C 










ala leu 


leu 


asp 


lys 


met 


leu 


ala 


asp 


ser 


ile gly 


leu 


arg 


asp 













SEQ ID 

FIGURE 21A' 



REPLACEMENT SHEET (RULE 26) 



CCT GGT CGC CGC GCA ACT GGC 
pro gly arg arg ala thr gly 
63/21 

CTG GTA GTG GCG ATG GTG TTA 
leu val val ala met val leu 
123/41 

GGC CTG CGG CTG GTC GCC GCG 
gly leu arg leu val ala ala 
183/61 

GCG CTG GAC GTC GCC GTG CTG 
ala leu asp val ala val leu 
243/81 

AAC TTC ACC GCC CGC AAG TAC 
asn phe thr ala arg lys tyr 
303/101 

GAC GTG CGG TCG GGA GTG AAC 
asp val arg ser gly val asn 

363/121 

CTG GCC GAC AGC ATC GGC TTG 
leu ala asp ser ile gly leu 



81/185 



sequence Rv3365c predicted 
Seq21A' 

1/1 

gtg acc atg ttc gcc cgc ccg 
val thr met phe ala arg pro 
61/21 

ccg get caa ccg gcc cgc ggc 
pro ala gin pro ala arg gly 
121/41 

tgg ccg gtc cga tgg aaa gtg 
trp pro val arg trp lys val 
181/61 

tta gca gga ttg egg gtc gag 
leu ala gly leu arg val glu 
241/81 

gcg cgc gcc gaa atg ata ccc 
ala arg ala glu met lie pro 

301/101 

ctg gcc age teg acc gga cac 
leu ala ser ser thr gly his 
361/121 

tac gag ctg cag acg cga ctg 
tyr glu leu gin thr arg leu 
421/141 

aac acg ctg etc aac ggc ggt 
asn thr leu leu asn gly gly 
481/161 

ttg egg gat egg gtc acc gee 
leu arg asp arg val thr ala 
541/181 

gcg tog gtg egg gtt gac age 
ala ser val arg val asp ser 
601/201 

gtt ggc gcc cgc ggg cag atg 
val gly ala arg gly gin met 
661/221 

gcc gag ccg caa ctg cgc age 
ala glu pro gin leu arg ser 
721/241 

ttc ggg atg age gcg gcg etc 
phe gly met ser ala ala leu 
781/261 

atg gtg acc agg atg gcg ate 
met val thr arg met ala ile 
841/281 

ctg ctg cac teg ata cag ate 
leu leu his ser ile gin ile 
901/301 

gag gcg gtg acg aag teg gtg 
glu ala val thr lys ser val 
961/321 

gac gee gtg ctg gtg ttg gcc 
asp ala val leu val leu ala 



by Cole et al . (Nature 



31/11 

acc ate ccg gtc gcg gcg 
thr ile pro val ala ala 

91/31 

aaa cct cag caa cgc ccg 
lys pro gin gin arg pro 

151/51 
ttc acg ate gcg ctt ctg 
phe thr ile ala leu leu 

211/71 
get gcg atg gcc age acc 
ala ala met ala ser thr 

271/91 
gcg ate acg aaa tac atg 
ala ile thr lys tyr met 

331/111 
gat gtg gag ggg gcg cag 
asp val glu gly ala gin 

391/131 
gee gac acc gac gtc ate 
ala asp thr asp val ile 

451/151 
cag gcg ctg ctg gat aag 
gin ala leu leu asp lys 

511/171 
tac gcg ccg ctg ctg ttg 
tyr ala pro leu leu leu 

571/191 
gag caa ate cga acc cag 
glu gin ile arg thr gin 

631/211 
acg atg cag gag ate ctg 
thr met gin glu ile leu 

691/231 
gcg atg gtt acc ctg gee 
ala met val thr leu ala 

751/251 
ggt gca ggc teg ccg gac 
gly ala gly ser pro asp 

811/271 
atg tec gat ccg gcc gtt 
met ser asp pro ala val 

871/291 
acc cgc gac att gcc gag 
thr arg asp ile ala glu 

931/311 
caa age cag gcc acc gac 
gin ser gin ala thr asp 

991/331 
gee ate gcg acc gcg ate 
ala ile ala thr ala ile 



SEQ ID N0.21F 
FIGURE 21D 



393:537-544) and containing 



gcc get tct gat att tec gcc 
ala ala ser asp ile ser ala 

ccg tec tgg teg ccg cgc aac 
pro ser trp ser pro arg asn 

ccg ctg gta gtg gcg atg gtg 
pro leu val val ala met val 

age ggc ctg egg ctg gtc gcc 
ser gly leu arg leu val ala 

teg gcg ctg gac gtc gee gtg 
ser ala leu asp val ala val 

aaa aac ttc acc gcc cgc aag 
lys asn phe thr ala arg lys 

gca gac gtg egg teg gga gtg 
ala asp val arg ser gly val 

gtg ctg gcc gac age ate ggc 
val leu ala asp ser ile gly 

acg gcc cag aac gtg att gac 
thr ala gin asn val ile asp 

gtg cag ggt ttg age cga gcc 
val gin gly leu ser arg ala 

gtg act cgc ggc gcc gac ctt 
val thr arg gly ala asp leu 

ggc ace gaa ccc teg acg ctg 
gly thr glu pro ser thr leu 

acc aag aac ctg cag cag caa 
thr lys asn leu gin gin gin 

gca ctg gtc aac aac cea gag 
ala leu val asn asn pro glu 

cag gtg ate acc gac ace ace 
gin val ile thr asp thr thr 

egg egg gat gcc gcg att cgc 
arg arg asp ala ala ile arg 

gtc gtc gtg ttg gtg gtg gcg 
val val val leu val val ala 



REPLACEMENT SHEET (RULE 26) 



86/185 



1861/621 

gtc ggg ctg cgc ggt ccg gtg acc ggt gaa 
val gly leu arg gly pro val thr gly glu 
1921/641 

ctg ccg eta gcc gtg etc gag ggg acg gcc 
leu pro leu ala val leu glu gly thr ala 
1981/661 

gcg ate aag ccg ccg tgt cet gaa ccc gcg 
ala ile lys pro pro cys pro glu pro ala 
2041/681 

ate ggg ccg eta cca ccg gtc acg ttg etc 
ile gly pro leu pro pro val thr leu leu 
2101/701 

gee gae gtc ccg gee cag ccg atg cag cag 
ala asp val pro ala gin pro met gin gin 
2161/721 

gag gat agg ttt caa cag gag ccc aaa caa 
glu asp arg phe gin gin glu pro lys gin 
2221/741 

eeg ccc gcc aaa ccc gcg cca ccg gcg ggc 
pro pro ala lys pro ala pro pro ala gly 
2281/761 

atg etc tec gag atg gtg ggt gac ccg cac 
met leu ser glu met val gly asp pro his 
2341/781 

aag teg gtg tgg gae cac ggc tgg teg gcg 
lys ser val trp asp his gly trp ser ala 
2401/801 

tec cgc acg gac tac ggc ctg ccg gtg cgc 
ser arg thr asp tyr gly leu pro val arg 
2461/821 

gcg gtg cet gag gga ccc gat egg gag cat 
ala val pro glu gly pro asp arg glu his 
2521/841 

ctt cat ccc ggc cga gcg ccg egg cac gcg 
leu his pro gly arg ala pro arg his ala 
2581/861 

gcc tec ate age age cat ttc ggc ggc gtg 
ala ser ile ser ser his phe gly gly val 
2641/881 

agt cag gga cce aat cag caa tga 
ser gin gly pro asn gin gin OPA 



1891/631 

cag ggc acc ggc acc acc gcc gag gtc tac 
gin gly thr gly thr thr ala glu val tyr 
1951/651 

cca gcg cag ccg cca aag ccg egg gta ttt 
pro ala gin pro pro lys pro arg val phe 
2011/671 

gcg gcc gat ccg acg gac gtt cce gee gee 
ala ala asp pro thr asp val pro ala ala 
2071/691 

ccg cgc cgt acc ccg ggg tec agt ggc ate 
pro arg arg thr pro gly ser ser gly ile 
2131/711 

egg egg cgc gag ctg aaa aca ccc tgg tgg 
arg arg arg glu leu lys thr pro trp trp 
2191/731 

ccg ccc gca cca gaa ccg cga ccg gcg ccg 
pro pro ala pro glu pro arg pro ala pro 
2251/751 

ccg gtt gat gac gac gtc ate tac egg egg 
pro val asp asp asp val ile tyr arg arg 
2311/771 

gag ctg gee cac age ecc gat ctg gae tgg 
glu leu ala his ser pro asp leu asp trp 
2371/791 

gcc gcc gag gcc gcg gac aag ccc gtg cag 
ala ala glu ala ala asp lys pro val gin 
2431/811 

gaa ccc ggg gcc egg tta gtg ccg ggg gcg 
glu pro gly ala arg leu val pro gly ala 
2491/831 

ccg ggt gca gcg eta gca tec aac ggc gga 
pro gly ala ala leu ala ser asn gly gly 
2551/851 

get gcg gta cgc gac ccc gac gcg gtt cgt 
ala ala val arg asp pro asp ala val arg 
2611/871 

cgc acc ggg egg teg cat gcc cgc gag age 
arg thr gly arg ser his ala arg glu ser 



SEQ ID No, 2 IF (continued 2) 
FIGURE 2 IF (continued 2) 



REPLACEMENT SHEET (RULE 26) 



# 



87/185 



CTA CGA CAA GGC AAA 
leu arg gin gly lys 
61/21 

CGC CAT TCT GGT CGC 
arg his ser gly arg 
121/41 

TGA GAG CAC GAG CGC 
OPA asp his asp arg 
181/61 

CGT CAT CGA CGG TAA 
arg his arg arg OCH 
241/81 

CAA TGT CAA CAT CGC 
gin cys gin his arg 
301/101 

CAA CCC TCC GGA GGT 
gin pro ser gly gly 
361/121 

CAC GTC GGG CAC CGG 
his val gly his arg 



GGA 


GCA 


CAG 


GGT 


GAA 


gly 


ala 


gin 


gly 


glu 


AGG 


TCT 


TTC 


CGG 


ATG 


arg 


ser 


phe 


arg 


met 


GGC 


AGG 


CAC 


GAC 


GGC 


gly 


arg 


his 


asp 


gly 


GGA 


CCA 


GAA 


CGT 


CAC 


gly pro 


glu 


arg 


his 


GAT 


CGG 


CGG 


GGC 


GGC 


asp 


arg 


arg 


gly 


gly 


GAA 


GTC 


CGT 


TGG 


GCT 


glu 


val 


arg 


trp 


ala 


ACA 


GGG 


TAA 


CGC 


TCG 


thr 


gly 


OCH 


arg 


ser 



31/11 

GCG TGG ACT GAC GGT 
ala trp thr asp gly 
91/31 

TTC T^G CAA CAA GTC 
phe lys gin gin val 
151/51 

AAG CCC CGG CGC CGC 
lys pro arg arg arg 
211/71 

CGG GTC TGT GGT GTG 
arg val cys gly val 
271/91 

GAC CGG CAT TGC CGC 
asp arg his cys arg 
331/111 

CGG TAA CGT CAA CGG 
arg OCH arg gin arg 
391/131 

GCA ACC AAG GAC GGC 
ala thr lys asp gly 



CGC GGT AGC CGG AGC 
arg gly ser arg ser 

GAC TAC AGG AAG CGG 
asp tyr arg lys arg 

ATC CGG GCC GAA GGT 
ile arg ala glu gly 

CAC AAC CGC GGC CGG 
his asn arg gly arg 

CGT GCT CAC CGA CGG 
arg ala his arg arg 

CGT CAC GCT GGG ATA 
arg his ala gly ile 

AGC CAC TAC AAG ATC 
ser his tyr lys ile 



SEQ ID No,22A 



FIGURE 22A 



TAC GAC AAG GCA AAG GAG CAC AGG GTG AAG 
tyr asp lys ala lys glu his arg val lys 
62/21 

GCC ATT CTG GTC GCA GGT CTT TCC GGA TGT 
ala ile leu val ala gly leu ser gly cys 
122/41 

GAG ACC ACG ACC GCG GCA GGC ACG ACG GCA 
glu thr thr thr ala ala gly thr thr ala 
182/61 

GTC ATC GAC GGT AAG GAC CAG AAC GTC ACC 
val ile asp gly lys asp gin asn val thr 
242/81 

AAT GTC AAC ATC GCG ATC GGC GGG GCG GCG 
asn val asn ile ala ile gly gly ala ala 
302/101 

AAC CCT CCG GAG GTG AAG TCC GTT GGG CTC 
asn pro pro glu val lys ser val gly leu 
362/121 

ACG TCG GGC ACC GGA CAG GGT AAC GCT CGG 
thr ser gly thr gly gin gly asn ala arg 



32/11 

CGT GGA CTG ACG GTC GCG GTA GCC GGA GCC 
arg gly leu thr val ala val ala gly ala 
92/31 

TCA AGC AAC AAG TCG ACT ACA GGA AGC GGT 
ser ser asn lys ser thr thr gly ser gly 
152/51 

AGC CCC GGC GCC GCA TCC GGG CCG AAG GTC 
ser pro gly ala ala ser gly pro lys val 
212/71 

GGG TCT GTG GTG TGC ACA ACC GCG GCC GGC 
gly ser val val cys thr thr ala ala gly 
272/91 

ACC GGC ATT GCC GCC GTG CTC ACC GAC GGC 
thr gly ile ala ala val leu thr asp gly 
332/111 

GGT AAC GTC AAC GGC GTC ACG CTG GGA TAC 
gly asn val asn gly val thr leu gly tyr 
392/131 

CAA CCA AGG ACG GCA GCC ACT ACA AGA TC 
gin pro arg thr ala ala thr thr arg 



SEQ ID N0.22B 



FIGURE 22B 



REPLACEMENT SHEET (RULE 26) 



88/185 



ACG ACA AGO CAA AGG AGC ACA 
thr thr arg gin arg ser thr 
63/21 

CCA TTC TOG TCG CAG GTC TTT 
pro phe trp ser gin val phe 
123/41 

AGA CCA CGA CCG CGG CAG GCA 
arg pro arg pro arg gin ala 
183/61 

TCA TCG ACG GTA AGG ACC AGA 
ser ser thr val arg thr arg 
243/81 

ATG TCA ACA TCG CGA TCG GCG 
met ser thr ser arg ser ala 
303/101 

ACC CTC CGG AGG TGA AGT CCG 
thr leu arg arg OPA ser pro 
363/121 

CGT CGG GCA CCG GAC AGG GTA 
arg arg ala pro asp arg val 



33/11 



GGG 


TGA 


T\ C C 




1 ^r\. 


gly 


OPA 


ser 


val asp 








y ^/ 0 1 




CCG 


GAT 








pro 


asp 


val 


gin ala 


tnr 








X 0 0/ ox 




CGA 


CGG 


CAA 






arg 


arg 


gin 


ala pro 


axa 








^ Xo/ /X 






1 K^lTi. 






TGG 


thr 


ser 


pro 


gly leu 


trp 








273/91 




GGG 


CGG 


CGA 


CCG GCA 


TTG 


gly 


arg 


arg 


pro ala 


leu 








333/111 




TTG 


GGC 


TCG 


GTA ACG 


TCA 


leu 


gly 


ser 


val thr 


ser 








393/131 




ACG 


CTC 


GGC 


AAC CAA 


GGA 


thr 


leu 


gly 


asn gin 


gly 



SEQ ID No, 22c 



FIGURE 22c 



CGG TCG CGG TAG CCG GAG CCG 
arg ser arg AMB pro glu pro 

AGT CGA CTA CAG GAA GCG GTG 
ser arg leu gin glu ala val 

CCG CAT CCG GGC CGA AGG TCG 
pro his pro gly arg arg ser 

TGT GCA CAA CCG CGG CCG GCA 
cys ala gin pro arg pro ala 

CCG CCG TGC TCA CCG ACG GCA 
pro pro cys ser pro thr ala 

ACG GCG TCA CGC TGG GAT ACA 
thr ala ser arg trp asp thr 

CGG CAG CCA CTA CAA GAT C 
arg gin pro leu gin asp 



GCA CAA CCG CGG CCG 
ala gin pro arg pro 
61/21 

CCG TGC TCA CCG ACG 
pro cys ser pro thr 
121/41 

GCG TCA CGC TGG GAT 
ala ser arg trp asp 
181/61 

GCA GCC ACT ACA AGA 
ala ala thr thr arg 
241/81 

CAT TCT GGT CGC AGG 
his ser gly arg arg 
301/101 

GAC CAC GAC CGC GGC 
asp his asp arg gly 
361/121 

ATC GAC GGT AAG GAC 
ile asp gly lys asp 
421/141 

GTC AAC ATC GCG ATC 
val asn ile ala ile 
481/161 

CCT CCG GAG GTG AAG 
pro pro glu val lys 
541/181 

TCG GGC ACC GGA CAG 
ser gly thr gly gin 



GCA ATG TCA ACA TCG 
ala met ser thr ser 

GCA ACC CTC CGG AGG 
ala thr leu arg arg 

ACA CGT CGG GCA CCG 
thr arg arg ala pro 

TCA CAG GGT GAA GCG 
ser gin gly glu ala 

TCT TTC CGG ATG TTC 
ser phe arg met phe 

AGG CAC GAC GGC AAG 
arg his asp gly lys 

CAG AAC GTC ACC GGC 
gin asn val thr gly 

GGC GGG GCG GCG ACC 
gly gly ala ala thr 

TCC GTT GGG CTC GGT 
ser val gly leu gly 

GGT AAC GCC TCG GCA 
gly asn ala ser ala 

SEQ ID 



31/11 

CGA TCG GCG GGG CGG 
arg ser ala gly arg 
91/31 

TGA AGT CCG TTG GGC 
OPA ser pro leu gly 
151/51 

GAC AGG GTA ACG CCT 
asp arg val thr pro 
211/71 

TGG ACT GAC GGT CGC 
trp thr asp gly arg 
271/91 

AAG CAA CAA GTC GAC 
lys gin gin val asp 
331/111 

CCC CGG CGC CGC TCC 
pro arg arg arg ser 
391/131 

TCC GTG GTG TGC ACA 

ser val val cys thr 
451/151 

GGC ATT GCC GCC GTG 
gly ile ala ala val 
511/171 

AAC GTC AAC GGC GTC 
asn val asn gly val 
571/191 

ACC AAG GAC GGC AGC 
thr lys asp gly ser 

No,23A 



CGA CCG GCA TTG CCG 
arg pro ala leu pro 

TCG GTA ACG TCA ACG 
ser val thr ser thr 

CGG CAA CCA AGG ACG 
arg gin pro arg thr 

GGT AGC CGG AGC CGC 
gly ser arg ser arg 

TAC AGG AAG CGG TGA 
tyr arg lys arg OPA 

GGG CCG AAG GTC GTC 
gly pro lys val val 

ACC GCG GCC GGC AAT 
thr ala ala gly asn 

CTC ACC GAC GGC AAC 
leu thr asp gly asn 

ACG CTG GGA TAC ACG 
thr leu gly tyr thr 

CAC TAC PJKG ATC 
his tyr lys ile 



FIGURE 23A 
REPLACEMENT SHEET (RULE 26) 



# 



89/185 



CAC AAC CGC GGC CGG 
his asn arg gly arg 
62/21 

CGT GCT CAC CGA CGG 
arg ala his arg arg 
122/41 

CGT CAC GCT GGG ATA 
arg his ala gly ile 
182/61 

CAG CCA CTA CAA GAT 
gin pro leu gin asp 
242/81 

ATT CTG GTC GCA GGT 
ile leu val ala gly 
302/101 

ACC ACG ACC GCG GCA 
thr thr thr ala ala 
362/121 

TCG ACG GTA AGG ACC 
ser thr val arg thr 
422/141 

TCA ACA TCG CGA TCG 
ser thr ser arg ser 
482/161 

CTC CGG AGG TGA AGT 
leu arg arg OPA ser 
542/181 

CGG GCA CCG GAC AGG 
arg ala pro asp arg 



CAA TGT CAA CAT CGC 
gin cys gin his arg 

CAA CCC TCC GGA GGT 
gin pro ser gly gly 

CAC GTC GGG CAC CGG 
his val gly his arg 

CAC AGG GTG AAG CGT 
his arg val lys arg 

CTT TCC GGA TGT TCA 
leu ser gly cys ser 

GGC ACG ACG GCA AGC 
gly thr thr ala ser 

AGA ACG TCA CCG GCT 
arg thr ser pro ala 

GCG GGG CGG CGA CCG 
ala gly arg arg pro 

CCG TTG GGC TCG GTA 
pro leu gly ser val 

GTA ACG CCT CGG CAA 
val thr pro arg gin 



32/11 

GAT CGG CGG GGC GGC 
asp arg arg gly gly 
92/31 

GAA GTC CGT TGG GCT 
glu val arg trp ala 
152/51 

ACA GGG TAA CGC CTC 
thr gly OCH arg leu 
212/71 

GGA CTG ACG GTC GCG 
gly leu thr val ala 
272/91 

AGC AAC AAG TCG ACT 
ser asn lys ser thr 
332/111 

CCC GGC GCC GCT CCG 
pro gly ala ala pro 
392/131 

CCG TGG TGT GCA CAA 
pro trp cys ala gin 
452/151 

GCA TTG CCG CCG TGC 
ala leu pro pro cys 
512/171 

ACG TCA ACG GCG TCA 
thr ser thr ala ser 
572/191 

CCA AGG ACG GCA GCC 
pro arg thr ala ala 



GAC CGG CAT TGC CGC 
asp arg his cys arg 

CGG TAA CGT CAA CGG 
arg OCH arg gin arg 

GGC AAC CAA GGA CGG 
gly asn gin gly arg 

GTA GCC GGA GCC GCC 
val ala gly ala ala 

ACA GGA AGC GGT GAG 
thr gly ser gly glu 

GGC CGA AGG TCG TCA 
gly arg arg ser ser 

CCG CGG CCG GCA ATG 
pro arg pro .ala met 

TCA CCG ACG GCA ACC 
ser pro thr ala thr 

CGC TGG GAT ACA CGT 
arg trp asp thr arg 

ACT ACA AGA TC 
thr thr arg 



SEQ ID No,23B 
FIGURE 23B 



REPLACEMENT SHEET (RULE 26) 



Ill 

90/185 



33/11 





ACA ACC 












AAC 


ATC 


GCG 


ATC GGC 


GGG 


GCG 


GCG 


ACC 


GGC 


ATT 


GCC 


GCC 




thr thr 


aia 


aia 


gly 


asn 


vax 


asn 


ixe 




ile gly 


y j.y 


aia 


aia 


thr 


crlv 


ile 


aia 


aia 




63/21 
















93/31 




















GTG CiL. 


ACC 








CCT 


CCG 


GAG 


GTG 


AAG TCC 


GTT 


GGG 


CTC 


GGT 


AAC 


GTC 


AAC 


GGC 




val leu 


unr 


asp 


gly 


asn 


p ro 


pro 


glu 


val 


lys ser 


val 


crl v 


leu 


crlv 


asn 


val 


asn 


crlv 




123/41 














153/51 




















GTC ACG 


CTG 


GGA 








GGC 


ACC 


GGA 


CAG GGT 


AAC 


GCC 


TCG 


GCA 


ACC 


AAG 


GAC 


GGC 




val thr 


leu 


gly 


tyr 


unr 


ser 




thr 




crln alv 


asn 


aia 


ser 


aia 


thr 


lys 


asp 


qiv 

^ J; 




183/61 
















213/71 




















AGC CAC 


TAC 


AAG 


ATC 


ACA 


GGG 


TGA 


AGC 


GTG 


GAC TGA 


CGG 


TCG 


CGG 


TAG 


CCG 


CAG 


CCG 


CCA 




ser his 


tyr 


j_ys 


ire 




gly 


OPA 


ser 


val 


asp OPA 


arg 


ser 


ara 


AMB 


pro 


glu 


pro 


pro 




243/81 














273/91 




















mm m<"/~' 


TCG 




J. ^ 


rpmrp 

1 J. i. 


CCG 


GAT 


GTT 


CAA 


GCA ACA 


AGT 


CGA 


CTA 


CAG 


GAA 


GCG 


GTG 


AGA 




phe trp 


ser 


gin 


vax 


phe 


pro 


asp 


val 


gin 


aia thr 


ser 


ara 


leu 


gin 


qlu 


aia 


val 


arg 


"J 


303/101 


















333/111 




















CCA CGA 


CCG 


CGG 


CAG 


GCA 


CGA 


CGG 


CAA 


GCC 


CCG GCG 


CCG 


CTC 


CGG 


GCC 


GAA 


GGT 


CGT 


CAT 




pro arg 


pro 


arg 


gin 


aia 


arg 


arg 


gin 


aia 


pro aia 


pro 


leu 


arg 


aia 


glu 


gly 


arg 


his 




363/121 
















393/131 




















CGA CGG 


TAA 


GGA 


CCA 


GAA 


CGT 


CAC 


CGG 


CTC 


CGT GGT 


GTG 


CAC 


AAC 


CGC 


GGC 


CGG 


CAA 


TGT 




arg arg 


OCH 


gly 


pro 


glu 


arg 


his 


arg 


leu 


arg gly 


val 


his 


asn 


arg 


gly 


arg 


gin 


cys 




423/141 
















453/151 




















CAA CAT 


CGC 


GAT 


CGG 


CGG 


GGC 


GGC 


GAC 


CGG 


CAT TGC 


CGC 


CGT 


GCT 


CAC 


CGA 


CGG 


CAA 


CCC 




gin his 
483/161 


arg 


asp 


arg 


arg 


gly 


gly 


asp 


arg 


his cys 
513/171 


arg 


arg 


aia 


his 


arg 


arg 


gin 


pro 




TCC GGA 


GGT 


GAA 


GTC 


CGT 


TGG 


GCT 


CGG 


TAA 


CGT CAA 


CGG 


CGT 


CAC 


GCT 


GGG 


ATA 


CAC 


GTC 




ser gly 


gly 


glu 


val 


arg 


trp 


aia 


arg 


OCH 


arg gin 


arg 


arg 


his 


aia 


gly 


ile 


his 


val 




543/181 
















573/191 




















GGG CAC 


CGG 


ACA 


GGG 


TAA 


CGC 


CTC 


GGC 


AAC 


CAA GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


C 






gly his 


arg 


thr 


gly 


OCH 


arg 


leu 


gly 


asn 


gin gly 


arg 


gin 


pro 


leu 


gin 


asp 







SEQ ID N0.23C 
FIGURE 23C 



REPLACEMENT SHEET (RULE 26) 




91/185 



CTA ACG ACA GGC AAA GGA GCA 
leu thr thr gly lys gly ala 
61/21 

CGC CAT TCT GGT CGC AGG TCT 
arg his ser gly arg arg ser 
121/41 

TGA GAC CAC GAC CGC GGC AGG 
OPA asp his asp arg gly arg 
181/61 

GTC ATC GAC GGT AAG GAC CAG 
val ile asp gly lys asp gin 
241/81 

AAT GTC AAC ATC GCG ATC GGC 
asn val asn ile ala ile gly 
301/101 

AAC CCT CCG GAG GTG AAG TCC 
asn pro pro glu val lys ser 
361/121 

ACG TCG GGC ACC GGA CAG GGT 
thr ser gly thr gly gin gly 



31/11 



CAG 


GGT 


GAA 


GCG TGG 


ACT 


gin 


gly 


glu 


ala trp 


thr 








91/31 




TTC 


CGG 


ATG 


TTC AAG 


CAA 


phe 


arg 


met 


phe lys 


gin 






151/51 




CAC 


GAC 


GGC 


AAG CCC 


CGG 


his 


asp 


gly 


lys pro 


arg 








211/71 




AAC 


GTC 


ACC 


GGC TCC 


GTG 


asn 


val 


thr 


gly ser 


val 








271/91 




GGG 


GCG 


GCG 


ACC GGC 


ATT 


gly 


ala 


ala 


thr gly 


ile 








331/111 




GTT 


GGG 


CTC 


GGT AAC 


GTC 


val 


gly 


leu 


gly asn 


val 








391/131 




AAC 


GCC 


TCG 


GCA ACC 


AAG 


asn 


ala 


ser 


ala thr 


lys 



SEQ ID N0.24A 
FIGURE 24A 



GAC GGT CGC GGT AGC CGG AGC 
asp gly arg gly ser arg ser 

CAA GTC GAC TAC AGG AAG CGG 
gin val asp tyr arg lys arg 

CGC CGC TCC GGG CCG AAG GTC 
arg arg ser gly pro lys val 

GTG TGC ACA ACC GCG GCC GGC 
val cys thr thr ala ala gly 

GCC GCC GTG CTC ACC GAC GGC 
ala ala val leu thr asp gly 

AAC GGC GTC ACG CTG GGA TAC 
asn gly val thr leu gly tyr 

GAC GGC AGC CAC TAC AAG ATC 
asp gly ser his tyr lys ile 



TAA CGA CAG GCA AAG GAG CAC 
OCH arg gin ala lys glu his 
62/21 

GCC ATT CTG GTC GCA GGT CTT 
ala ile leu val ala gly leu 
122/41 

GAG ACC ACG ACC GCG GCA GGC 
glu thr thr thr ala ala gly 

182/61 

TCA TCG ACG GTA AGG ACC AGA 
ser ser thr val arg thr arg 
242/81 

ATG TCA ACA TCG CGA TCG GCG 
met ser thr ser arg ser ala 

302/101 

ACC CTC CGG AGG TGA AGT CCG 
thr leu arg arg OPA ser pro 
362/121 

CGT CGG GCA CCG GAC AGG GTA 
arg arg ala pro asp arg val 









32/11 




AGG 


GTG 


AAG 


CGT GGA 


CTG 


arg 


val 


lys 


arg gly 


leu 








92/31 




TCC 


GGA 


TGT 


TCA AGC 


AAC 


ser 


gly 


cys 


ser ser 


asn 








152/51 




ACG 


ACG 


GCA 


AGC CCC 


GGC 


thr 


thr 


ala 


ser pro 


gly 








212/71 




ACG 


TCA 


CCG 


GCT CCG 


TGG 


thr 


ser 


pro 


ala pro 


trp 








272/91 




GGG 


CGG 


CGA 


CCG GCA 


TTG 


gly 


arg 


arg 


pro ala 


leu 








332/111 




TTG 


GGC 


TCG 


GTA ACG 


TCA 


leu 


gly 


ser 


val thr 


ser 








392/131 




ACG 


CCT 


CGG 


CAA CCA 


AGG 


thr 


pro 


arg 


gin pro 


arg 



SEQ ID N0.24B 
FIGURE 24B 



ACG GTC GCG GTA GCC GGA GCC 
thr val ala val ala gly ala 

AAG TCG ACT ACA GGA AGC GGT 
lys ser thr thr qly ser gly 

GCC GCT CCG GGC CGA AGG TCG 
ala ala pro gly arg arg ser 

TGT GCA CAA CCG CGG CCG GCA 
cys ala gin pro arg pro ala 

CCG CCG TGC TCA CCG ACG GCA 
pro pro cys ser pro thr ala 

ACG GCG TCA CGC TGG GAT ACA 
thr ala ser arg trp asp thr 

ACG GCA GCC ACT ACA AGA TC 
thr ala ala thr thr arg 



REPLACEMENT SHEET (RULE 26) 




92/185 



33/11 



AAC GAC 


AGG 


CAA 


AGG 


AGC 


ACA 


GGG 


1 GrA 


7<r*r* r'^r* Cht^ 


TGA 


CGG 


1 




TAG 


CCG 


GAG 


CCG 


asn asp 


arg 


gin 


arg 


ser 


tnr 


gly 


UrA 


ser vaj_ dop 


OPA 


arg 


ser 


arg 


AMB 


pro 


glu 


pro 


63/21 














Zf -D / ^ ± 


















CCA TTC 


TGG 


TCG 


CAG 


GTC 


TTT 


CCG 


<^ TV rp 


f^mrp (-*7\Ti. finHk 


ACA 


AGT 


CGA 


CTA 


CAG 


GAA 


GCG 


GTG 


pro phe 


trp 


ser 


gin 


va _L 


phe 


p ro 


asp 




thr 


ser 


arg 


leu 


gin 


glu 


ala 


val 


123/41 
















-1 c: o / c -1 


















AGA CCA 


CGA 


CCG 


CGG 


CAG 






r*r*c 


nazi. f^rc* ccfz 


GCG 


CCG 


CTC 


CGG 


GCC 


GAA 


GGT 


CGT 




arg 


pro 


arg 


aln 


ala 


arq 


arg 


gin ala pro 


ala 


pro 


leu 


arg 


ala 


glu 


gly 


arg 


183/61 
















213/71 


















CAT CGA 


CGG 


TAA 


GGA 


CCA 


GAA 


CGT 


CAC 


CGG CTC CGT 


GGT 


GTG 


CAC 


AAC 


CGC 


GGC 


CGG 


CAA 


his arg 


arg 


OCH 


gly 


pro 


glu 


arg 


his 


arg leu arg 


gly val 


his 


asn 


arg 


gly 


arg 


gin 


243/81 














273/91 


















TGT CAA 


CAT 


CGC 


GAT 


CGG 


CGG 


GGC 


GGC 


GAC CGG CAT 


TGC 


CGC 


CGT 


GCT 


CAC 


CGA 


CGG 


CAA 


cys gin 


his 


arg 


asp 


arg 


arg 


gly 


qly 


asp arg his 


cys 


arg 


arg 


ala 


his 


arg 


arg 


gin 


303/101 
















333/111 


















CCC TCC 


GGA 


GGT 


GAA 


GTC 


CGT 


TGG 


GCT 


CGG TAA CGT 


CAA 


CGG 


CGT 


CAC 


GCT 


GGG 


ATA 


CAC 


pro ser 


gly 


gly 


glu 


val 


arg 


trp 


ala 


arg OCH arg 


gin 


arg 


arg 


his 


ala 


gly 


ile 


his 


363/121 














393/131 


















GTC GGG 


CAC 


CGG 


ACA 


GGG 


TAA 


CGC 


CTC 


GGC AAC CAA 


GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


C 


val gly 


his 


arg 


thr 


gly 


OCH 


arg 


leu 


gly asn gin 


gly 


arg 


gin 


pro 


leu 


gin 


^sp 





SEQ ID N0.24C 
FIGURE 24C 



Direct priiaer 
5' ACG CGG CGC AGC CTG TTG 3' 



SEQ ID No. 25 
FIGURE 25 



Reverse primer 
5' CGA CCT TGG GAT TCG CCT 3' 



SEQ ID No. 26 
FIGURE 26 



REPLACEMENT SHEET (RULE 26) 



93/185 



CCT ACC AGC AAG AGC CCA GGG 
pro thr ser lys ser pro gly 
61/21 

TCC AAT TTT CCT TCC GCC CCG 
ser asn phe pro ser ala pro 
121/41 

CGG TCG CGC AGC TTG CGG AAA 
arg ser arg ser leu arg lys 
181/61 

GTC GTC GOT CTC CAC CCG TCG 
val val ala leu his pro ser 
241/81 

TCG GCT GAG TGT CTG CAG GGC 
ser ala glu cys leu gin gly 
301/101 

AAG TCC CCC CCG CGC GAC TTC 
lys ser pro pro arg asp phe 
361/121 

CTC ATT TCG CAG CAA GCG GTC 
leu ile ser gin gin ala val 
421/141 

GTC GGA CGA GTC GTC GTC AAC 
val gly arg val val val asn 









31/11 




CTT 


CAC 


AGG 


ACC TAA 


AAG 


leu 


his 


arg 


thr OCH 


lys 








91/31 




TGC 


AAT 


ACC 


ATC TGC 


AAG 


cys 


asn 


thr 


ile cys 


lys 








151/51 




CGG 


GGT 


ATG 


GAC CCT 


GCC 


arg 


gly 


met 


asp pro 


ala 








211/71 




GGG 


GGC 


GAA 


AGC CAT 


TCC 


Cfl V 

J. jr 


qIv 


glu 


ser his 


ser 








271/91 




TCC 


GGG 


GAG 


CAG CCG 


ATC 


ser 


gly 


glu 


gin pro 


ile 








331/111 




CAG 


ACA 


TTT 


GTT GTG 


GTT 


gin 


thr 


phe 


val val 


val 








391/131 




TCC 


GGG 


TCG 


CAG CAT 


CGT 


ser 


gly 


ser 


gin his 


arg 


GAC 


CAC 


GAT 


C 




asp 


his 


asp 







SEQ ID No,27A 
FIGURE 27A 



GAG TAG CGC CCA TGG GCT TGA 
glu AMB arg pro trp ala OPA 

ACC AGC GAC GGC CCG TGG TTG 
thr ser asp gly pro trp leu 

GTA CCG TTG TTG CCA CTT GAT 
val pro leu leu pro leu asp 

GAC ACT GGG ATC CTC AAA ACG 
asp thr gly ile leu lys thr 

ATC ACC ATG TAC GAA CTG AAT 
ile thr met tyr glu leu asn 

TCG GTT GAG GCC GAG GCG AGG 
ser val glu ala glu ala arg 

TGC GGC GAT CGC GGC GCA GTC 
cys gly asp arg gly ala val 



CTA CCA GCA AGA GCC 
leu pro ala arg ala 
61/21 

CCA ATT TTC CTT CCG 
pro ile phe leu pro 
121/41 

GGT CGC GCA GCT TGC 
gly arg ala ala cys 
181/61 

TCG TCG CTC TCC ACC 
ser ser leu ser thr 
241/81 

CGG CTG AGT GTC TGC 
arg leu ser val cys 
301/101 

AGT CCC CCC CGC GCG 
ser pro pro arg ala 
361/121 

TCA TTT CGC AGC AAG 
ser phe arg ser lys 
421/141 

TCG GAC GAG TCG TCG 
ser asp glu ser ser 



CAG 


GGC 


TTC 


ACA 


GGA 


gin 


gly phe 


thr 


gly 


CCC 


CGT 


GCA 


ATA 


CCA 


pro 


arg 


ala 


ile 


pro 


GGA 


AAC 


GGG 


GTA 


TGG 


gly 


asn 


gly 


val 


trp 


CGT 


CGG 


GGG 


GCG 


AAA 


arg 


arg 


gly 


ala 


lys 


AGG 


GCT 


CCG 


GGG 


AGC 


arg 


ala 


pro 


gly 


ser 


ACT 


TCC 


AGA 


CAT 


TTG 


thr 


ser 


arg 


his 


leu 


CGG 


TCT 


CCG 


GGT 


CGC 


arg 


ser 


pro 


gly 


arg 


TCA 


ACG 


ACC 


ACG 


ATC 


ser 


thr 


thr 


thr 


ile 



31/11 

CCT AAA AGG AGT AGC 
pro lys arg ser ser 
91/31 

TCT GCA AGA CCA GCG 
ser ala arg pro ala 
151/51 

ACC CTG CCG TAC CGT 
thr leu pro tyr arg 
211/71 

GCC ATT CCG ACA CTG 
ala ile pro thr leu 
271/91 

AGC CGA TCA TCA CCA 
ser arg ser ser pro 
331/111 

TTG TGG TTT CGG TTG 
leu trp phe arg leu 
391/131 

AGC ATC GTT GCG GCG 
ser ile val ala ala 



GCC CAT GGG CTT GAT 
ala his gly leu asp 

ACG GCC CGT GGT TGC 
thr ala arg gly cys 

TGT TGC CAC TTG ATG 
cys cys his leu met 

GGA TCC TCA AAA CGT 
gly ser ser lys arg 

TGT ACG AAC TGA ATA 
cys thr asn OPA ile 

AGG CCG AGG CGA GGC 
arg pro arg arg gly 

ATC GCG GCG CAG TCG 
ile ala ala gin ser 



SEQ ID N0.27B 



FIGURE 27B 
REPLACEMENT SHEET (RULE 26) 




94/185 



33/11 



TAG CAG 


CAA 


GAG 


CCC 


AGG 


GCT 


TCA 


CAG 


GAC 


/— < m TV 7\ TV 7\ 

CTA AAA 


GGA 


GTA 


GCG 




Ai \J 




mm f 
i 1 G 




tyr gin 


gin 


glu 


pro 


arg 


ala 


ser 


gin 


asp 


leu lys 


gly 


val 


aia 


pro 


met 


gly 


leu 


ixe 


63/21 


















93/31 


















CAA TTT 


TCC 


TTC 


CGC 


CCC 


GTG 


CAA 


TAC 


CAT 


CTG CAA 


GAC 


CAG 


CGA 


CGG 


CCC 


GTG 


/-•mrri 
Vji i 




gin phe 


ser 


phe 


arg 


pro 


val 


gin 


tyr 


his 


leu gin 


asp 


gin 


arg 


arg 


pro 


vax 


vax 


axa 


123/41 


















153/51 


















GTC GCG 


CAG 


CTT 


GCG 


GAA 


ACG 


GGG 


TAT 


GGA 


CCC TGC 


CGT 


ACC 


GTT 


GTT 






i G/V 


rp/^rp 

i tji 


val ala 


gin 


leu 


ala 


glu 


thr 


gly 


tyr 


gly 


pro cys 


arg 


thr 


vai 


vax 


a±a 


tnr 




cys 


183/61 
















213/7 1 


















CGT CGC 


TOT 


CCA 


CCC 


GTC 


GGG 










CGA 


CAC 


TGG 


GAT 


CCT 


CAA 


AAC 


GTC 


arg arg 


ser 


pro 


pro 


val 


gly 


gly 


arg 


lys 


pro p^e 


a r g 


hi s 


trp 


asp 


pro 


gin 


as n 


val 


243/81 


































GGC 1 GA 


Gi O 


i i 








CGG 


GGA 


GCA 


GCC GAT 


CAT 


CAC 


CAT 


GTA 


CGA 


ACT 


GAA 


TAA 


gly OPA 


val 


ser 


ala 


gly 


leu 


arg 


gly 


ala 


ala asp 


his 


his 


his 


val 


arg 


thr 


glu 


OCH 


303/101 


















333/111 


















GTC CCC 


CCC 


GCG 


CGA 


CTT 


CCA 


GAC 


ATT 


TGT 


TGT GGT 


TTC 


GGT 


TGA 


GGC 


CGA 


GGC 


GAG 


GCT 


val pro 


pro 


ala 


arg 


leu 


pro 


asp 


lie 


cys 


cys gly 


phe 


gly 


OPA 


gly 


arg 


gly 


glu 


ala 


363/121 
















393/131 


















CAT TTC 


GCA 


GCA 


AGC 


GGT 


CTC 


CGG 


GTC 


GCA 


GCA TCG 


TTG 


CGG 


CGA 


TCG 


CGG 


CGC 


AGT 


CGT 


his phe 


ala 


ala 


ser 


gly 


leu 


arg 


val 


ala 


ala ser 


leu 


arg 


arg 


ser 


arg 


arg 


ser 


arg 


423/141 




































CGG ACG 


AGT 


CGT 


CGT 


CAA 


CGA 


CCA 


CGA 


TC 




















arg thr 


ser 


arg 


arg 


gin 


arg 


pro 


arg 























SEQ ID N0.27C 
FIGURE 27C 



MKTGTATTRRRLIAVLIALALPGAAVALLAEPSATGASDPCAASEVAR 
TVGSVAKSMGDYLDSHPETNQVMTAVLQQQVGPGSVASLKAHFEANPK 
VAS DLHALSQPLTDLS T RC S L P I S GLQAI GLMQAVQGARR 



SEQ ID No*28 
FIGURE 28 



GTGGGCAAGC AGCTAGCCGC GCTCGCCGCG CTGGTCGGTG CGTGCATGCT CGCAGCCGGA 
TGCACCAACG TGGTCGACGG GACCGCCGTG GCTGCCGACA AATCCGGACC ACTGCATCAG 
GATCCGATAC CGGTTTCAGC GCTTGAAGGG CTGCTTCTCG ACTTGAGCCA GATCAATGCC 
GCGCTGGGTG CGACATCGAT GAAGGTGTGG TTCAACGCCA AGGCAATGTG GGACTGGAGC 
AAGAGCGTGG CCGACAAGAA TTGCCTGGCT ATCGACGGTC CAGCACAGGA AAAGGTCTAT 
GCCGGCACCG GGTGGACCGC TATGCGCGGC CAACGGCTGG ATGACAGCAT CGATGACTCC 
AAGAAACGCG ACCACTACGC CATTCAAGCG GTCGTCGGCT TCCCGACCGC ACATGATGCC 
GAGGAGTTCT ACAGCTCCTC GGTGCAT^AGC TGGAGCAGCT GCTCGAACCG CCGGTTTGTC 
GAAGTCACCC CCGGACAGGA CGACGCCGCC TGGACTGTGG CTGACGTTGT CAACGACAAC 
GGCATGCTCA GTAGCTCGCA GGTTCAGGAA GGCGGCGACG GATGGACCTG CCAGCGTGCC 
CTGACTGCGC GCAACAACGT CACTATCGAC ATTGTCACGT GCGCCTATAG CCAACCGGAT 
TTGGTGGCGA TTGGCATCGC TAACCAAATC GCGGCCAAGG TTGCTAAGCA GTAG 



SEQ ID No. 29 

FIGURE 29 
REPLACEMENT SHEET (RULE 26) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
714 



95/185 



MGKQIJ^AIJ^VGACMIJiAGCTNWDGTAVAADKSGPLHQDPIPVFTSALEGLLLDLSQ^ 
MKVWFNAKAMWDWS KS VADKNCLAI DGPAQEKVYAGTGFTWTAMRGQR^ 

GFPTAHDAEEFYSSSVQSWSSCSNRRFVEVTFTPGQDDAAWTVADWNDNGMLSSSQVQEGGDGWTCQ 
RALTARNNVTI DI VTCAYSQPDLVFTAI GIANQIAAKVAKQ 



SEQ ID No. 30 
FIGURE 30 



1/1 31/11 



J./ X 


ATA 


CCC 


GCG 


AGG 


GCA 


GCG 


CGA 


CGG 


CGG CCC 


TGC 


CGG 


CGC 


CGT 


GGC 


TGC 


TGA 


ACA 


a,x.y cLxy 


X X C 


pro 


ala 


arg 


ala 


ala 


arg 


arg 


arg pro 


cys 


arg 


arg 


arg 


gly 


cys 


OPA 


thr 


















91/31 




















CCC 


AGC 


CGC 


GCA 


CGC 


TTC 


CGG 


TAT 


GCG GCA 


GGA 


TAA 


ACG 


ACC 


CCA 


ACA 


GCA 


CGA 




p JTO 


s er 


arg 


ala 


arg 


phe 


arg 


tyr 


ala ala 


gly 


OCH 


thr 


thr 


pro 


thr 


ala 


arg 


1 o 1 //II 
±Z X / ft X 
















151/51 


















ACA CCA 


GGA 


TTG 


CGA 


CAA 


CCA 


AAG 


CCC 


TCG 


CGC CTG 


GCT 


CGA 


TTT 


CGC 


GCG 


CAA 


CGC 


GGC 


thr pro 


gly 


leu 


arg 


gin 


pro 


lys 


pro 


ser 


arg leu 


ala 


arg 


phe 


arg 


ala 


gin 


arg 


gly 


181/61 
















211/71 


















GTT CTG 


CCG 


CCT 


CGA 


TCT 


CAG 


CGC 


GGA 


GGG 


CGT CGA 


GAT 


C 














val leu 


pro 


pro 


arg 


ser 


gin 


arg 


gly 


gly 


arg arg 


asp 






























SEQ ID 


N0.31A 






























FIGURE 31A 


















1/1 


















31/11 


















GGC GAA 


TAG 


CCG 


CGA 


GGG 


CAG 


CGC 


GAC 


GGC 


GGC CCT 


GCC 


GGC 


GCC 


GTG 


GCT 


GCT 


GAA 


CAA 


gly glu 


tyr 


pro 


arg 


gly 


gin 


arg 


asp 


gly 


gly pro 


ala 


gly 


ala 


val 


ala 


ala 


glu 


gin 


61/21 


















91/31 


















CAC ATC 


CCA 


GCC 


GCG 


CAC 


GCT 


TCC 


GGT 


ATG 


CGG CAG 


GAT 


T^iAA 


CGA 


CCC 


CAA 


CAG 


CAC 


GAA 


his ile 


pro 


ala 


ala 


his 


ala 


ser 


gly 


met 


arg gin 


asp 


lys 


arg 


pro 


gin 


gin 


his 


glu 


121/41 
















151/51 


















CAC CAG 


GAT 


TGC 


GAG 


AAC 


CAA 


AGC 


CCT 


CGC 


GCC TGG 


CTC 


GAT 


TTC 


GCG 


CGC 


AAC 


GCG 


GCG 


his gin 


asp 


cys 


asp 


asn 


gin 


ser 


pro 


arg 


ala trp 


leu 


asp 


phe 


ala 


arg 


asn 


ala 


ala 


181/61 


















211/71 


















TTC TGC 


CGC 


CTC 


GAT 


CTC 


AGC 


GCG 


GAG 


GGC 


GTC GAG 


ATC 
















phe cys 


arg 


leu 


asp 


leu 


ser 


ala 


glu 


gly 


val glu 


ile 

















SEQ ID N0.31B 
FIGURE 31B 



REPLACEMENT SHEET (RULE 26) 



1/1 


















GCG AAT 


ACC 


CGC 


GAG 


GGC 


AGC 


GCG 


ACG 


GCG 


ala asn 


thr 


arg 


glu 


gly 


ser 


ala 


thr 


ala 


61/21 


















ACA TCC 


CAG 


CCG 


CGC 


ACG 


CTT 


CCG 


GTA 


TGC 


thr ser 


gin 


pro 


arg 


thr 


leu 


pro 


val 


cys 


121/41 


















ACC AGG 


ATT 


GCG 


ACA ACC 


AAA 


GCC 


CTC 


GCG 


thr arg 


ile 


ala 


thr 


thr 


lys 


ala 


leu 


ala 


181/61 


















TCT GCC 


GCC 


TCG 


ATC 


TCA 


GCG 


CGG 


AGG 


GCG 


ser ala 


ala 


ser 


ile 


ser 


ala 


arg 


arg 


ala 



96/185 

31/11 

GCC CTG CCG GCG CCG TGG CTG CTG AAC AAC 
ala leu pro ala pro trp leu leu asn asn 
91/31 

GGC AGG ATA AAC GAC CCC AAC AGC ACG AAC 
gly arg ile asn asp pro asn ser thr asn 
151/51 

CCT GGC TCG ATT TCG CGC GCA ACG CGG CGT 
pro gly ser ile ser arg ala thr arg arg 
211/71 
TCG AGA. TC 
ser arg 



SEQ ID N0.31C 



FIGURE 31C 



ORF according to Cole et al . (Nature 393:537-544) and containing seqSlA 
1/1 31/11 

taa acg acc cca aca gca cga aca cca gga ttg cga caa cca aag ccc teg cgc ctg get 
OCH thr thr pro thr ala arg thr pro gly leu arg gin pro lys pro ser arg leu ala 
61/21 91/31 

cga ttt cgc gcg caa cgc ggc gtt ctg ccg cct cga tct cag cgc gga ggg cgt cga gat - 
arg phe arg ala gin arg gly val leu pro pro arg ser gin arg gly gly arg arg asp 
121/41 151/51 

ccc egg cgt cgt gtt cgt ggc tea tea tct gca tec tec ggg ctt ggc cgc get gac egg 
pro arg arg arg val arg gly ser ser ser ala ser ser gly leu gly arg ala asp arg 
181/61 211/71 

cag ccc gac ccc agg cat gee cag gee gac ggc gcg ccc egg ctg ccc ggc ggt gtg cgc 
gin pro asp pro arg his ala gin ala asp gly ala pro arg leu pro gly gly val arg 
241/81 271/91 

gtc gcc ggc gcg ggt gcg gcg gtg ggt cag gac gcc ggc gtc ggc gat gag gtg gtg egg 
val ala gly ala gly ala ala val gly gin asp ala gly val gly asp glu val val arg 
301/101 331/111 

cgc cgc ttc ggt gac ctt cgt ggt gat gac gtc gcc ggg acg cac gcg egg ctg gcc ggc 
arg arg phe gly asp leu arg gly asp asp val ala gly thr his ala arg leu ala gly 
361/121 391/131 

ggt gaa gtg cac cag gcg ccc gtc gcg cgc ccg ccc get cat gcg cgc cgt gac ggt gtc 
gly glu val his gin ala pro val ala arg pro pro ala his ala arg arg asp gly val 
421/141 451/151 

ctt gcg ccc ttc ccc ggt ggc cac cag cac etc gac ggc ctg ccc gac cag ggc gcg gtt 
leu ala pro phe pro gly gly his gin his leu asp gly leu pro asp gin gly ala val 
481/161 511/171 

ggc ttc cag cga gat ttg etc ctg cag cgc gat cag gcg ttc ata gcg ttc ctg cac aac 
aly phe gin arg asp leu leu leu gin arg asp gin ala phe ile ala phe leu his asn 
541/181 571/191 

ggc ttt egg cag ctg tec gtc gag ttg cgc ggc egg tgt ccc ggg ccg ctt gga gta ttg 
gly phe arg gin leu ser val glu leu arg gly arg cys pro gly pro leu gly val leu 
601/201 631/211 

gaa ggt aaa tgc ggc cgc gaa gcg ggc ccg gcg cac cac gtc gag cgt ggc cgc gaa gtc 
glu gly lys cys gly arg glu ala gly pro ala his his val glu arg gly arg glu val 
661/221 691/231 

etc ttc ggt etc ccc ggg gaa acc gac gat cag ate ggt ggt aat cgc ggc atg egg gat 
leu phe gly leu pro gly glu thr asp asp gin ile gly gly asn arg gly met arg asp 
721/241 751/251 

ggc cgc ccg cac gcg etc gat gat gcc gag gta gcg etc ggc acg ata gga ccg ccg eat 
gly arg pro his ala leu asp asp ala glu val ala leu gly thr ile gly pro pro his 
781/261 811/271 
cgc gcg cag gat ccg gtc gga tec gga ctg tag 
arg ala gin asp pro val gly ser gly leu AMB 



SEQ ID No,31F 



FIGURE 31F 



REPLACEMENT SHEET (RULE 26) 



97/185 



1/1 

aga ctg gtg tac acg 
arg leu val tyr thr 
61/21 

eta gtg aag gtc att 
leu val lys val ile 
121/41 

ttc gtc ggt ttc gcg 
phe val gly phe ala 
181/61 

ccg ttg egg gac tac 
pro leu arg asp tyr 
241/81 

ggg ccg acc gcg ctg 
gly pro thr ala leu 

301/101 

tac ggg cgc ate aaa 
tyr gly arg ile lys 



gag acc aag ctg aac 
glu thr lys leu asn 

cag aaa ctg teg ggc 
gin lys leu ser gly 

egg atg gtc gag gee 
arg met val glu ala 

gaa ctg ggc acg gtg 
glu leu gly thr val 

aac tat gtg cgc get 
asn tyr val arg ala 

cgc cag cag ttg ttt 
arg gin gin leu phe 



31/11 

teg gea ttc tec ttc 
ser ala phe ser phe 
91/31 

ttg age ate aac egg 
leu ser ile asn arg 
151/51 

etc ggc ggc gtc gag 
leu gly gly val glu 
211/71 

ctg gag cae gee gga 
leu glu his ala gly 
271/91 

cgc cag gtc acc acc 
arg gin val thr thr 
331/111 

ttg teg teg ctg ctg 
leu ser ser leu leu 



ggc ggg ect aag tgt 
gly gly pro lys cys 

ttc ate gcg att gac 
phe ile ala ile asp 

gta tgc age ace ace 

val cys ser thr thr 

cgc cag gtc att gac 
arg gin val ile asp 

gag age aat ggc gac 
glu ser asn gly asp 

cgt teg atg ate 
arg ser met ile 



SEQ ID N0.32A 
FIGURE 32A 



1/1 

gac tgg tgt aca 
asp trp cys thr 
61/21 

tag tga agg tea 
AMB OPA arg ser 
121/41 

teg teg gtt teg 
ser ser val ser 
181/61 

cgt tgc ggg act 
arg cys gly thr 
241/81 

ggc ega ccg cgc 
gly arg pro arg 
301/101 

acg gge gea tea 
thr gly ala ser 



egg aga cca age 
arg arg pro ser 

ttc aga aae tgt 
phe arg asn cys 

cgc gga tgg teg 
arg gly trp ser 

acg aac tgg gea 
thr asn trp ala 

tga act atg tgc 
OPA thr met cys 

aac gee age agt 
asn ala ser ser 



31/11 
tga act egg cat 
OPA thr arg his 

91/31 
egg get tga gea 
arg ala OPA ala 

151/51 
agg ccc teg gcg 
arg pro ser ala 

211/71 
egg tgc tgg age 
arg cys trp ser 

271/91 
gcg etc gee agg 
ala leu ala arg 

331/111 
tgt ttt tgt cgt 
cys phe cys arg 



tet ect teg gcg 
ser pro ser ala 

tea ace ggt tea 
ser thr gly ser 

gcg teg agg tat 
ala ser arg tyr 

acg ccg gac gee 
thr pro asp ala 

tea cca ccg aga 
ser pro pro arg 

cgc tgc tgc gtt 
arg cys cys val 



gge eta agt gtc 
gly leu ser val 

teg ega ttg act 
ser arg leu thr 

gea gea cca ccc 
ala ala pro pro 

agg tea ttg acg 
arg ser leu thr 

gea atg gcg act 
ala met ala thr 

ega tga te 
arg OPA 



SEQ ID N0.32B 



FIGURE 32B 



REPLACEMENT SHEET (RULE 26) 



il 



98/185 



1/1 31/11 

act ggt gta cac gga gac caa get gaa etc ggc att etc ctt egg egg gcc taa gtg tct 
thr gly val his gly asp gin ala giu leu gly ile leu leu arg arg aia OCH val ser 
61/21 91/31 

agt gaa ggt cat tea gaa act gtc ggg ctt gag cat caa ceg gtt cat cgc gat tga ctt 
ser giu gly his ser giu thr val gly leu giu his gin pro val his arg asp OPA leu 
121/41 151/51 

cgt egg ttt cgc gcg gat ggt cga ggc cct egg egg cgt cga ggt atg cag cac cac ccc 
arg arg phe arg ala asp giy arg gly pro arg arg arg arg gly met gin his his pro 
181/61 211/71 

gtt gcg gga eta cga act ggg cac ggt get gga gca cgc egg acg cea ggt cat tga egg 
val ala giy leu arg thr gly his gly ala gly ala arg arg thr pro giy his OPA arg 
241/81 271/91 

gee gae cgc get gaa eta tgt gcg cgc teg cca ggt cac cac cga gag caa tgg cga eta 
ala asp arg ala giu leu cys ala arg ser pro gly his his arg giu gin trp arg leu 
301/101 331/111 

egg gcg cat caa acg cca gca gtt gtt ttt gtc gte get get gcg tte gat gat c 
arg aia his gin thr pro ala vai val phe val val ala ala ala phe asp asp 

SEQ ID No ,320 
FIGURE 32C 



sequence Rv0822c predicted by Cole et al. (Nature 393:537-5 44) and containing seq 32A 
1/1 31/11 

atg agt gac ggc gag age gcc gcg ceg tgg gca egg etc tee gag tea gca tte ccc gat 
Met ser asp giy giu ser ala ala pro trp aia arg leu ser giu ser aia phe pro asp 
61/21 91/31 

ggt gtt gac cga tgg ate acg gta ceg ccc gcc aca tgg gtg gca gee cag ggt ceg egg 
gly val asp arg trp iie thr vai pro pro ala thr trp val ala ala gin gly pro arg 
121/41 151/51 

gae ace cag aat gtc ggc tgt cat gee aec ggc gee gtt agt gtg gcc gat ctg ate gcc 
asp thr gin asn vai gly cys his ala thr gly aia vai ser val ala asp leu lie ala 
181/61 211/71 

agg etc ggc ccc get ttt ect gae etc ccc acg cac cgc eat gtc gee ccc gaa ccc gag 
arg leu gly pro ala phe pro asp leu pro thr his arg his vai ala pro giu pro giu 
241/81 271/91 

cca tec ggc cgc ggc ceg aag gtc cac gae gac gcc gac gac cag cag gae ace gag get 
pro ser gly arg gly pro lys val his asp asp ala asp asp gin gin asp thr giu ala 
301/101 331/111 

ate gee ate ceg gee cac teg etc gag tte etc teg gag ctt ccc gae etc egg gca gcc 
ile ala ile pro ala his ser leu giu phe leu ser giu leu pro asp leu arg ala ala 
361/121 391/131 

aac tat ceg cgc gee gac cac gee cgc cgt gaa ccc gag eta ccc ggc aag cag ,cta ace 
asn tyr pro arg ala asp his ala arg arg giu pro giu leu pro gly lys gin leu thr 
421/141 451/151 

gga teg get cga gtg egg cca ttg egg ate cgc cga acg teg ccc gcg ccc gcc aag cca 
gly ser aia arg val arg pro leu arg iie arg arg thr ser pro ala pro ala lys pro 
481/161 511/171 

gcg ceg aac tee ggc egg cgc ceg atg gtg ctg gcc gcg cgc teg ctg gcg get ctg ttt 
ala pro asn ser gly arg arg pro met val leu ala ala arg ser leu ala ala leu phe 
541/181 571/191 

gcc get ctg gcg ttg gcg ctg aec ggc ggg gca tgg cag tgg age gcg teg aag aac age 
aia ala leu ala leu ala leu thr giy gly ala trp gin trp ser ala ser lys asn ser 
601/201 631/211 

egg ctg aac atg gta age gcg etc gac ceg cat teg ggc gac ate gtc aac ccc age ggg 
arg leu asn met val ser aia leu asp pro his ser gly asp iie val asn pro ser giy 

SEQ ID N0.32D 

FIGURE 32D 
REPLACEMENT SHEET (RULE 26) 



99/185 



661/221 

cag cat ggc gac gag aac ttc ttg etc gtc 
gin his gly asp glu asn phe leu leu val 
721/241 

aat ate ggc gcc ggc gac gcc gag gac gcc 
asn ile gly ala gly asp ala glu asp ala 
781/261 

gtc aac att ccg gcc age cgc gag egg gtc 
val asn ile pro ala ser arg glu arg val 
841/281 

ate act cca ate caa tgc gag gcg tgg aac 
ile thr pro ile gin cys glu ala trp asn 
901/301 

gac gag aag acg gga acg atg ggt ccc aga 
asp glu lys thr gly thr met gly pro arg 
961/321 

gca ttc tee ttc ggc ggg cct aag tgt eta 
ala phe ser phe gly gly pro lys cys leu 
1021/341 

age ate aac egg ttc ate gcg att gac ttc 
ser ile asn arg phe ile ala ile asp phe 
1081/361 

ggc ggc gtc gag gta tgc age ace ace ccg 
gly gly val glu val cys ser thr thr pro 
1141/381 

gag eac gcc gga cgc cag gtc att gac ggg 
glu his ala gly arg gin val ile asp gly 
1201/401 

cag gtc ace acc gag age aat ggc gac tac 
gin val thr thr glu ser asn gly asp tyr 
1261/421 

teg teg ctg etg cgt teg atg ate teg acg 
ser ser leu leu arg ser met ile ser thr 
1321/441 

aac gtc gtc aac atg ttc ate ggt aac age 
asn val val asn met phe ile gly asn ser 
1381/461 

gtc gaa etc ggt cga teg ttg cag eat atg 
val glu leu gly arg ser leu gin his met 
1441/481 

ccg ace ggt ata acc gac cag aac ggc gac 
pro thr gly ile thr asp gin asn gly asp 
1501/501 

ctt ttc acc gcc ate ate gac gac gat ccg 
leu phe thr ala ile ile asp asp asp pro 
1561/521 

cgt ctg ggc aac acg ccg teg ace ccg ccg 
arg leu gly asn thr pro ser thr pro pro 
1621/541 

etg ace aac gag att cag cac cag cag gtt 
leu thr asn glu ile gin his gin gin val 
1681/561 

cag gtc tct aac teg ace ggc cag gcc ggt 
gin val ser asn ser thr gly gin ala gly 
1741/581 

egg aac ggc ttc aac gtg atg get ccg gac 
arg asn gly phe asn val met ala pro asp 
1801/601 

gtg ttt ttt teg ccc ggc aac gaa cag get 
val phe phe ser pro gly asn glu gin ala 
1861/621 

tea aag ate gag egg gtg ace ggg ate ggc 
ser lys ile glu arg val thr gly ile gly 



691/231 

ggt atg gac tct cgt gcc ggg gcg aac gcc 
gly met asp ser arg ala gly ala asn ala 
751/251 

ggc ggc gca cgt teg gac acc gtc atg ctg 
gly gly ala arg ser asp thr val met leu 
811/271 

gtc gcg gtg teg ttc ccc cgc gac etg gcg 
val ala val ser phe pro arg asp leu ala 
871/291 

ccc gag acc ggt aag tac gga ccc ate tac 
pro glu thr gly lys tyr gly pro ile tyr 
931/311 

ctg gtg tac acg gag ace aag ctg aac teg 
leu val tyr thr glu thr lys leu asn ser 
991/331 

gtg aag gtc att cag aaa ctg teg ggc ttg 
val lys val ile gin lys leu ser gly leu 
1051/351 

gtc ggt ttc gcg egg atg gtc gag gcc etc 
val gly phe ala arg met val glu ala leu 
1111/371 

ttg egg gac tac gaa etg ggc acg gtg ctg 
leu arg asp tyr glu leu gly thr val leu 
1171/391 

ccg ace gcg etg aac tat gtg cgc get cgc 
pro thr ala leu asn tyr val arg ala arg 
1231/411 

ggg cgc ate aaa cgc cag cag ttg ttt ttg 
gly arg ile lys arg gin gin leu phe leu 
1291/431 

gac ace ttg ttc aac etc age agg etc aac 
asp thr leu phe asn leu ser arg leu asn 
1351/451 

tac gtg gac aac gtc aag ace aaa gac ctg 
tyr val asp asn val lys thr lys asp leu 
1411/471 

gcg gee ggg eac gtc acg ttc gtg acc gtt 
ala ala gly his val thr phe val thr val 
1471/491 

gag ccc ccg cgt ace tec gac atg aag gcg 
glu pro pro arg thr ser asp met lys ala 
1531/511 

ctg ccc ctg gaa aac gat eac aac gcc cag 
leu pro leu glu asn asp his asn ala gin 
1591/531 

ace acc ace aag aag gcg ccg cag gcg ggt 
thr thr thr lys lys ala pro gin ala gly 
1651/551 

acg acg acc teg cca aaa gag gtc aca gtg 
thr thr thr ser pro lys glu val thr val 
1711/571 

ttg gcc acc acc gcc acc gat cag etc aag 
leu ala thr thr ala thr asp gin leu lys 
1771/591 

gac tac ccg agt teg ctg ctg gcc acc aca 
asp tyr pro ser ser leu leu ala thr thr 
1831/611 

gcc gcc acc gtg gcc gcc gtg ttc ggc cag 
ala ala thr val ala ala val phe gly gin 
1891/631 

caa ctg gtc cag gtg gtg ctg ggc caa gac 
gin leu val gin val val leu gly gin asp 



SEQ ID No,32D (continued 1) 
FIGURE 32D (continued 1) 



REPLACEMENT SHEET (RULE 26) 



100/185 



1921/641 

ttc age gcg gtg cgc 
phe ser ala val arg 
1981/661 

aac tec tec age cca 
asn ser ser ser pro 
2041/681 

acc acc tgc gag tag 
thr thr cys glu AMB 



get ccc ctg ccg agt 
ala pro leu pro ser 

ccg acc aag ctg ccc 
pro thr lys leu pro 



1951/651 

ggc tec ace gtc age 
gly ser thr val ser 
2011/671 

gag gac ctg acg gtc 
glu asp leu thr val 



gtg cag ata age cgc 
val gin ile ser arg 

acc aac gee gee gac 
thr asn ala ala asp 



SEQ ID N0.32D (continued 2) 
FIGURE 32D (continued 2) 



ORF according to Cole et al, (Nature 393:537-544) and containing Rv0822c 
1/1 31/11 

tag gac atg agt gac ggc gag age gcc gcg ccg tgg gca egg etc tec gag tea gca ttc 
AMB asp met ser asp gly glu ser ala ala pro trp ala arg leu ser glu ser ala phe 
61/21 91/31 

ccc gat ggt gtt gac cga tgg ate acg gta ccg ccc gcc aca tgg gtg gca gee cag ggt 
pro asp gly val asp arg trp ile thr val pro pro ala thr trp val ala ala gin gly 
121/41 151/51 

ccg egg gac acc cag aat gtc ggc tgt cat gcc acc ggc gcc gtt agt gtg gcc gat ctg 
pro arg asp thr gin asn val gly cys his ala thr gly ala val ser val ala asp leu 
181/61 211/71 

ate gee agg etc ggc ccc get ttt ect gac etc ccc acg cac cgc cat gtc gcc ccc gaa 
ile ala arg leu gly pro ala phe pro asp leu pro thr his arg his val ala pro glu 
241/81 271/91 

ccc gag cca tee ggc cgc ggc ccg aag gtc cac gac gac gcc gac gac cag cag gac acc 
pro glu pro ser gly arg gly pro lys val his asp asp ala asp asp gin gin asp thr 
301/101 331/111 

gag get ate gee ate ccg gcc cac teg etc gag ttc etc teg gag ctt ccc gac etc egg 
glu ala ile ala ile pro ala his ser leu glu phe leu ser glu leu pro asp leu arg 
361/121 391/131 

gca gcc aac tat ccg cgc gcc gac cac gcc cgc cgt gaa ccc gag eta ccc ggc aag cag 
ala ala asn tyr pro arg ala asp his ala arg arg glu pro glu leu pro gly lys gin 
421/141 451/151 

eta acc gga teg get cga gtg egg cca ttg egg ate cgc cga acg teg ccc gcg ccc gcc 
leu thr gly ser ala arg val arg pro leu arg ile arg arg thr ser pro ala pro ala 
481/161 511/171 

aag cca gcg ccg aac tec ggc egg cgc ccg atg gtg ctg gcc gcg cgc teg ctg gcg get 
lys pro ala pro asn ser gly arg arg pro met val leu ala ala arg ser leu ala ala 
541/181 571/191 

ctg ttt gcc get ctg gcg ttg gcg ctg ace ggc ggg gca tgg cag tgg age gcg teg aag 
leu phe ala ala leu ala leu ala leu thr gly gly ala trp gin trp ser ala ser lys 
601/201 631/211 

aac age egg ctg aac atg gta age gcg etc gac ccg cat teg ggc gac ate gtc aac ccc 
asn ser arg leu asn met val ser ala leu asp pro his ser gly asp ile val asn pro 
661/221 691/231 

age ggg cag eat ggc gac gag aac ttc ttg etc gtc ggt atg gac tct cgt gcc ggg gcg 
ser gly gin his gly asp glu asn phe leu leu val gly met asp ser arg ala gly ala 
721/241 751/251 

aac gcc aat ate ggc gcc ggc gac gcc gag gac gee ggc ggc gca cgt teg gac acc gtc 
asn ala asn ile gly ala gly asp ala glu asp ala gly gly ala arg ser asp thr val 
781/261 811/271 

atg ctg gtc aac att ccg gcc age cgc gag egg gtc gtc gcg gtg teg ttc ccc cgc gac 
met leu val asn ile pro ala ser arg glu arg val val ala val ser phe pro arg asp 



SEQ ID No,32F 



FIGURE 32F 
REPLACEMENT SHEET (RULE 26) 



101/185 



841/281 871/291 



ctg gcg ate 


act 


cca 


ate 


caa 


tgc 


gag 


gcg 


tgg aac ccc 


gag 


acc 


ggt 


aag 


tac 


gga 


ccc 


leu ala ile 


thr 


pro 


ile 


gin 


cys 


glu 


ala 


trp asn pro 


glu 


thr 


gly 


lys 


tyr 


gly pro 


901/301 
















931/311 
















ate tac gac 


gag 


aag 


acg 


gga 


acg 


atg 


ggt 


ccc aga ctg 


gtg 


tac 


acg 


gag 


acc 


aag 


ctg 


ile tyr asp 


glu 


lys 


thr 


gly 


thr 


met 


gly pro arg leu 


val 


tyr 


thr 


glu 


thr 


lys 


leu 


961/321 
















991/331 
















aac teg gca 


ttc 


tec 


ttc 


ggc 


ggg 


cct 


aag 


tgt eta gtg 


aag 


gte 


att 


cag 


aaa 


ctg 


teg 


asn ser ala 


phe 


ser 


phe 


gly 


gly pro 


lys 


cys leu val 


lys 


val 


ile 


gin 


lys 


leu 


ser 


1021/341 
















1051/351 
















ggc ttg age 


ate 


aac 


egg 


ttc 


ate 


gcg 


att 


gac ttc gte 


ggt 


ttc 


gcg 


egg 


atg 


gte 


gsg 


gly leu ser 


ile 


asn 


arg phe 


ile 


ala 


ile 


asp phe val 


gly 


phe 


ala 


arg 


met 


val 


glu 


1081/361 
















1111/371 
















gcc etc ggc 


ggc 


gte 


gag 


gta 


tgc 


age 


acc 


acc ccg ttg 


egg 


gac 


tac 


gaa 


ctg 


ggc 


acg 


ala leu gly 


gly 


val 


glu val 


cys 


ser 


thr 


thr pro leu 


arg 


asp 


tyr 


glu 


leu 


gly 


thr 


1141/381 
















1171/391 
















gtg ctg gag 


cac 


gcc 


gga 


cgc 


cag 


gte 


att 


gac ggg ccg 


acc 


gcg 


ctg 


aac 


tat 


gtg 


cgc 


val leu glu 


his 


ala 


gly arg 


gin val 


ile 


asp gly pro 


thr 


ala 


leu 


asn 


tyr 


val 


arg 


1201/401 
















1231/411 
















get cgc cag 


gte 


acc 


acc 


gag 


age 


aat 


ggc 


gac tac ggg 


cgc 


ate 


aaa 


cgc 


cag 


cag 


ttg 


ala arg gin 


val 


thr 


thr 


glu 


ser 


asn 


gly asp tyr gly arg 


ile 


lys 


arg 


gin 


gin 


leu 


1261/421 
















1291/431 
















ttt ttg teg 


teg 


ctg 


ctg 


cgt 


teg 


atg 


ate 


teg acg gac 


acc 


ttg 


ttc 


aac 


etc 


age 


agg 


phe leu ser 


ser 


leu 


leu 


arg 


ser 


met 


ile 


ser thr asp 


thr 


leu 


phe 


asn 


leu 


ser 


arg 


1321/441 
















1351/451 
















etc aac aac 


gte 


gte 


aac 


atg 


ttc 


ate 


ggt 


aac age tac 


gtg 


gac 


aac 


gte 


aag 


acc 


aaa 


leu asn asn 


val 


val 


asn met phe 


ile 


gly 


asn ser tyr 


val 


asp 


asn 


val 


lys 


thr 


lys 


1381/461 
















1411/471 
















gac ctg gte 


gaa 


etc 


ggt 


cga 


teg 


ttg 


cag 


cat atg gcg 


gcc 


ggg 


cac 


gte 


acg 


ttc 


gtg 


asp leu val 


glu 


leu 


gly 


arg 


ser 


leu 


gin his met ala ala 


gly 


his 


val 


thr 


phe val 


1441/481 
















1471/491 
















acc gtt ccg 


acc 




ata 


acc 


gac 


cag 


aac 


ggc gac gag 


ccc 


ccg 


cgt 


acc 


tec 


gac 


atg 


thr val pro 


thr 




ile 


thr 


asp 


gin 


asn 


gly asp glu pro 


pro 


arg 


thr 


ser 


asp 


met 


1501/501 
















1531/511 
















aag gcg ctt 


ttc 


acc 


gcc 


ate 


ate 


gac 


gac 


gat ccg ctg 


ccc 


ctg 


gaa 


aac 


gat 


cac 


aac 


iys ala leu 


phe 


thr 


ala 


ile 


ile 


asp 


asp 


asp pro leu 


pro 


leu 


glu 


asn 


asp 


his 


asn 


1561/521 
















1591/531 
















gcc cag cgt 


ctg 


ggc 


aac 


acg 


ccg 


teg 


acc 


ccg ccg acc 


acc 


acc 


aag 


aag 


gcg 


ccg 


cag 


ala gin arg 


leu 


gly 


asn 


thr 


pro 


ser 


thr 


pro pro thr 


thr 


thr 


lys 


lys 


ala 


pro 


gin 


1621/541 
















1651/551 
















gcg ggt ctg 


acc 


aac 


gag 


att 


cag 


cac 


cag 


cag gtt acg 


acg 


acc 


teg 


cca 


aaa 


gag 


gte 


ala gly leu 


thr 


asn 


glu 


ile 


gin 


his 


gin 


gin val thr 


thr 


thr 


ser 


pro 


lys 


glu val 


1681/561 
















1711/571 
















aca gtg cag 


gte 


tct 


aac 


teg 


acc 


ggc 


cag 


gee ggt ttg 


gcc 


acc 


acc 


gcc 


acc 


gat 


cag 


thr val gin 


val 


ser 


asn 


ser 


thr 


gly 


gin 


ala gly leu 


ala 


thr 


thr 


ala 


thr 


asp 


gin 


1741/581 
















1771/591 
















etc aag egg 


aac 


ggc 


ttc 


aac 


gtg 


atg 


get 


ccg gac gac 


tac 


ccg 


agt 


teg 


ctg 


ctg 


gcc 


leu lys arg 


asn 


gly 


phe 


asn 


val 


met 


ala 


pro asp asp 


tyr 


pro 


ser 


ser 


leu 


leu 


ala 


1801/601 
















1831/611 
















ace aca gtg 


ttt 


ttt 


teg 


ccc 


ggc 


aac 


gaa 


cag get gee 


gee 


acc 


gtg 


gcc 


gcc 


gtg 


ttc 


thr thr val 


phe 


phe 


ser 


pro 


gly 


asn 


glu 


gin ala ala 


ala 


thr 


val 


ala 


ala 


val 


phe 


1861/621 
















1891/631 
















ggc cag tea 


aag 


ate 


gag 


egg 


gtg 


acc 


ggg 


ate ggc caa 


ctg 


gte 


cag 


gtg 


gtg 


ctg 


ggc 


gly gin ser 


iys 


ile 


glu arg val 


thr 


gly 


ile gly gin 


leu 


val 


gin 


val 


val 


leu 


gly 


1921/641 
















1951/651 
















caa gac ttc 


age 


gcg 


gtg 


cgc 


get 


ccc 


ctg 


ccg agt ggc 


tec 


acc 


gte 


age 


gtg 


cag 


ata 


gin asp phe 


ser 


ala 


val 


arg 


ala 


pro 


leu 


pro ser gly 


ser 


thr 


val 


ser 


val 


gin 


ile 


1981/661 
















2011/671 
















age cgc aac 


tee 


tec 


age 


cca 


ccg 


ace 


aag 


ctg ccc gag 


gac 


ctg 


acg 


gte 


acc 


aac 


gcc 


ser arg asn 


ser 


ser 


ser 


pro 


pro 


thr 


lys 


leu pro glu asp 


leu 


thr 


val 


thr 


asn 


ala 



2041/681 

gcc gac acc acc tgc gag tag 
ala asp thr thr cys glu AMB 



SEQ ID 32 F {continued 1) 
FIGURE 32F (continued 1) 



REPLACEMENT SHEET (RULE 26) 



102/185 



1/1 31/11 



CGT CAC 


CTC 


TGC 


CAT 


GGT 


CCA 


TCT 


ACG 


GTA 


TCT GCG 


ACA 


AGG 


GCA 


GCG 


TCG 


ATC 


CCT 


CGA 


arg his 


leu 


cys 


his 


gly 


pro 


ser 


thr 


val 


ser ala 


thr 


arg 


ala 


ala 


ser 


ile 


pro 


arg 




















91/31 


















CAT GCA 


GAG 


TCG 


GTG 


TTC 


GCT 


TCA 


CGC 


GAA 


CTA GGC 


GCG 


CCT 


AGC 


CTG 


GAC 


GAG 


TCC 


CCG 


his ala 


glu 


ser 


val 


phe 


ala 


ser 


arg 


glu 


leu gly 


ala 


pro 


ser 


leu 


asp 


glu 


ser 


pro 


121/41 


















151/51 


















GGC CGA 


CAT 


TCG 


CCC 


GAG 


GCC 


TTG 


GCC 


TCC 


ATC ACC 


TAA 


TTG 


TGT 


GCA 


AAA 


CCG 


TAT 


CTA 


gly arg 


his 


ser 


pro 


glu 


ala 


leu 


ala 


ser 


ile thr 


OCH 


leu 


cys 


ala 


lys 


pro 


tyr 


leu 


181/61 


















211/71 


















ATT GAT 


ACG 


ATT 


GCG 


CAC 


ATG 


GCT 


ATC 


TGG 


GAT C 


















ile asp 


thr 


ile 


ala 


nis 


met 


ala 


ile 


trp 


asp 
































SEQ ID 


No . 33A 






























FIGURE 33A 


















1/1 


















31/11 


















GTC ACC 


TCT 


GCC 


ATG 


GTC 


CAT 


CTA 


CGG 


TAT 


CTG CGA 


CAA 


GGG 


CAG 


CGT 


CGA 


TCC 


CTC 


GAC 


val thr 


ser 


ala 


met 


val 


his 


leu 


arg 


tyr 


leu arg 


gin 


gly 


gin 


arg 


arg 


ser 


leu 


asp 


61/21 


















91/31 


















ATG CAG 


AGT 


CGG 


TGT 


TCG 


CTT 


CAC 


GCG 


AAC 


TAG GCG 


CGC 


CTA 


GCC 


TGG 


ACG 


AGT 


CCC 


CGG 


met gin 


ser 


arg 


cys 


ser 


leu 


his 


ala 


asn 


AMB ala 


arg 


leu 


ala 


trp 


thr 


ser 


pro 


arg 


121/41 


















151/51 


















GCC GAC 


ATT 


CGC 


CCG 


AGG 


CCT 


TGG 


CCT 


CCA 


TCA CCT 


AAT 


TGT 


GTG 


CAA 


AAC 


CGT 


ATC 


TAA 


ala asp 


ile 


arg 


pro 


arg 


pro 


trp 


pro 


pro 


ser pro 


asn 


cys 


val 


gin 


asn 


arg 


ile 


OCH 


181/61 


















211/71 


















i i r\xx\ 






CCC 




TGG 


CTA 


TCT 


GGG 


ATC 


















leu, ile 


arg 


leu 


arg 


thr 


trp 


leu 


ser 


gly 


ile 
































SEQ ID 


No . 33B 






























FIGURE 33B 


















1/1 


















31/11 


















CCG TCA 


CCT 


CTG 


CCA 


TGG 


TCC 


ATC 


TAC 


GGT 


ATC TGC 


GAC 


AAG 


GGC 


AGC 


GTC 


GAT 


CCC 


TCG 


pro ser 


pro 


leu 


pro 


trp 


ser 


ile 


tyr 


gly 


ile cys 


asp 


lys 


gly 


ser 


val 


asp 


pro 


ser 


61/21 


















91/31 


















ACA TGC 


AGA 


GTC 


GGT 


GTT 


CGC 


TTC 


ACG 


CGA 


ACT AGG 


CGC 


GCC 


TAG 


CCT 


GGA 


CGA 


GTC 


CCC 


thr cys 


arg 


val 


gly val 


arg 


phe 


thr 


arg 


thr arg 


arg 


ala 


AMB 


pro 


gly 


arg 


val 


pro 


121/41 


















151/51 


















GGG CCG 


ACA 


TTC 


GCC 


CGA 


GGC 


CTT 


GGC 


CTC 


CAT CAC 


CTA 


ATT 


GTG 


TGC 


AAA 


ACC 


GTA 


TCT 


gly pro 


thr 


phe 


ala 


arg 


gly 


leu 


gly 


leu 


his his 


leu 


ile 


val 


cys 


lys 


thr 


val 


ser 


181/61 


















211/71 


















AAT TGA 


TAC 


GAT 


TGC 


GCA 


CAT 


GGC 


TAT 


CTG 


GGA TC 


















asn CPA 


tyr 


asp 


cys 


ala 


his 


gly 


tyr 


leu 


gly 



















SEQ ID N0.33C 
FIGURE 33C 

REPLACEMENT SHEET (RULE 1€) 



103/185 



sequence Rvl044 predicted 
seq33A 

1/1 

ttg tgt gca aaa ccg tat eta 
leu cys ala lys pro tyr leu 
61/21 

gtc gag gtt gcc gcc gag caa 
val glu val ala ala glu gin 
121/41 

gtc gac cct gtg cag etc cgc 
val asp pro val gin leu arg 
181/61 

ggt gtg tac egg gtg cec gtg 
gly val tyr arg val pro val 
241/81 

teg tgg act ttg ggg cgt ggc 
ser trp thr leu gly arg gly 
301/101 

get gac gtg aac ccg teg cgc 
ala asp val asn pro ser arg 
361/121 

gcc ggg ggc gag etg tac ega 
ala gly gly glu leu tyr arg 
421/141 

gtc gac gga ata cec gtc acg 
val asp gly He pro val thr 
481/161 

acg gat cct tat cag ett egg 
thr asp pro tyr gin leu arg 
541/181 

cgt ggg tea gea get gag eta 
arg gly ser ala ala glu leu 
601/201 

egg ccg aag ega gca teg geg 
arg pro lys arg ala ser ala 



by Cole et al . {Nature 



31/11 

att gat acg att geg eac 
ile asp thr ile ala his 
91/31 

cat ggc tac gtc acg act 
his gly tyr val thr thr 
151/51 

etc eta geg ggg cgc gga 
leu leu ala gly arg gly 

211/71 
etg ccg egt ggt gag cae 
leu pro arg gly glu his 

271/91 
gtt ate teg eat gag teg 
val ile ser his glu ser 

331/111 
ate cat etc acc gtc ccg 
ile his leu thr val pro 

391/131 
gtt cae cgc cgc gac etc 
val his arg arg asp leu 

451/151 
acg gtt geg cgc ace ate 
thr val ala arg thr ile 

511/171 
gcc geg ate gag ega gcc 
ala ala ile glu arg ala 

571/191 
cgc get geg etc gat gag 
arg ala ala leu asp glu 

tga 
OPA 



393:537-544) and containing 



atg get ate tgg gat cgc etc 
met ala ile trp asp arg leu 

cgc gat geg ega gac ate ggc 
arg asp ala arg asp ile gly 

cgt ett gag egt gtc ggc ega 
arg leu glu arg val gly arg 

gac gat etc gca gee gca gtg 
asp asp leu ala ala ala val 

gcc ttg geg ett cat gcc etc 
ala leu ala leu his ala leu 

cgc aac aac cat ccg cgt geg 
arg asn asn his pro *arg ala 

cag gca gcc eac gtc act teg 
gin ala ala his val thr ser 

aaa gac tgc gtg aag acg ggc 
lys asp cys val lys thr gly 

gaa gee gag ggc acg ett cgt 
glu ala glu gly thr leu arg 

acc act gee gga tta cgc get 
thr thr ala gly leu arg ala 



SEQ ID N0.33D 
FIGURE 33D 



REPLACEMENT SHEET (RULE 26) 



104/185 



ORF according to Cole et al . (Nature 393:537-544) and containing Rvl044 
1/1 31/11 

taa ttg tgt gca aaa ccg tat eta att gat acg att gcg cac atg get ate tgg gat cgc 
OCH leu cys ala lys pro tyr leu ile asp thr ile ala his met ala ile trp asp arg 
61/21 91/31 

etc gtc gag gtt gcc gcc gag caa cat ggc tac gtc acg act cgc gat gcg cga gac ate 
leu val glu val ala ala glu gin his gly tyr val thr thr arg asp ala arg asp lie 
121/41 151/51 

ggc gtc gac cct gtg cag etc cgc etc eta gcg ggg cgc gga cgt ctt gag egt gtc ggc 
alv val asp pro val gin leu arg leu leu ala gly arg gly arg leu glu arg val gly 
181/61 211/71 

cga ggt gtg tac egg gtg cce gtg etg ccg cgt ggt gag eac gac gat etc gca gcc gca 
arg gly val tyr arg val pro val leu pro arg gly glu his asp asp leu ala ala ala 
241/81 271/91 

gtg teg tgg act ttg ggg cgt ggc gtt ate teg eat gag teg gee ttg gcg ctt cat gcc 
val ser trp thr leu gly arg gly val ile ser his glu ser ala leu ala leu his ala 
301/101 331/111 

etc get gac gtg aac ccg teg cgc ate eat etc ace gtc ccg cgc aac aac cat ccg cgt 
leu ala asp val asn pro ser arg ile his leu thr val pro arg asn asn his pro arg 
361/121 391/131 

gcg gcc ggg ggc gag ctg tac cga gtt cac cgc cgc gac etc cag gca gcc cac gtc act 
ala ala gly gly glu leu tyr arg val his arg arg asp leu gin ala ala his val thr 
421/141 451/151 

teg gtc gac gga ata cce gtc acg acg gtt gcg cgc ace ate aaa gac tgc gtg aag acg 
ser val asp gly ile pro val thr thr val ala arg thr ile lys asp cys val lys thr 
481/161 511/171 

ggc acg gat cct tat cag ctt egg gcc gcg ate gag cga gcc gaa gcc gag ggc acg ctt 
aly thr asp pro tyr gin leu arg ala ala ile glu arg ala glu ala glu gly thr leu 
541/181 571/191 

cgt cgt ggg tea gca get gag eta cgc get gcg etc gat gag ace act gee gga tta cgc 
arg arg gly ser ala ala glu leu arg ala ala leu asp glu thr thr ala gly leu arg 
601/201 

get egg ccg aag cga gca teg gcg tga 
ala arg pro lys arg ala ser ala OPA 

SEQ ID No,33F 
FIGURE 33F 



1/1 

ATC CAA CCT GCT GGG CCT GCG 
ile gin pro ala gly pro ala 
61/21 

ACA ACA CCT GGA ATG GGG ACC 
thr thr pro gly met gly thr 
121/41 

CCT CGG TCG AGA CGT ATC GCG 
pro arg ser arg arg ile ala 
181/61 

CGC CGC ATA TTT GCC GCG CTG 
arg arg ile phe ala ala leu 









31/11 




CCT 


TCG 


AAT 


CGA CGG 


CCA 


pro 


ser 


asn 


arg arg 


pro 






91/31 




TTT 


TCG 


GTG 


TTG CTG 


GTA 


phe 


ser 


val 


leu leu 


val 






151/51 




GCA 


GCG 


TTG 


GCC CTG 


TCG 


ala 


ala 


leu 


ala leu 


ser 








211/71 




CCG 


CGG 


GCC 


GGA TC 




pro 


arg 


ala 


gly 





SEQ ID N0.34A 
FIGURE 34A 



GGC CAC CGC TCG CTG CCG GCA 
gly his arg ser leu pro ala 

ACC GGG ACA ACC GGC ACC ACG 
thr gly thr thr gly thr thr 

TTG CTG ACA ATT ACC GCT GGC 
leu leu thr ile thr ala gly 



REPLACEMENT SHEET (RULE 26) 



105/185 



1/1 

TCC AAC CTG CTG GGC 
ser asn leu leu gly 
61/21 

CAA CAC CTG GAA TGG 
gin his leu glu trp 
121/41 

CTC GGT CGA GAC GTA 
leu gly arg asp val 
181/61 

GCC GCA TAT TTG CCG 
ala ala tyr leu pro 



CTG CGC CTT CGA ATC 
leu arg leu arg lie 

GGA OCT TTT CGG TGT 
gly pro phe arg cys 

TCG CGG CAG CGT TGG 
ser arg gin arg trp 

CGC TGC CGC GGG CCG 
arg cys arg gly pro 



31/11 

GAC GGC CAG GCC ACC 
asp gly gin ala thr 
91/31 

TGC TGG TAA CCG GGA 
cys trp OCH pro gly 
151/51 

CCC TGT CGT TGC TGA 

pro cys arg cys OPA 

211/71 

GAT C 

asp 



GCT CGC TGC CGG CAA 
ala arg cys arg gin 

CAA CCG GCA CCA CGC 
gin pro ala pro arg 

CAA TTA CCG CTG GCC 
gin leu pro leu ala 



SEQ ID N0.34B 
FIGURE 34B 



1/1 

GAT CCA ACC TGC TGG 
asp pro thr cys trp 
61/21 

AAC AAC ACC TGG AAT 
asn asn thr trp asn 
121/41 

GCC TCG GTC GAG ACG 
ala ser val glu thr 
181/61 

CCG CCG CAT ATT TGC 
pro pro his lie cys 



GCC TGC GCC TTC GAA 
ala cys ala phe glu 

GGG GAC CTT TTC GGT 
gly asp leu phe gly 

TAT CGC GGC AGC GTT 
tyr arg gly ser val 

CGC GCT GCC GCG GGC 
arg ala ala ala gly 



31/11 

TCG ACG GCC AGG CCA 
ser thr ala arg pro 
91/31 

GTT GCT GGT AAC CGG 
val ala gly asn arg 
151/51 

GGC CCT GTC GTT GCT 
gly pro val val ala 
211/71 
CGG ATC 
arg ile 



CCG CTC GCT GCC GGC 

pro leu ala ala gly 

GAC AAC CGG CAC CAC 

asp asn arg his his 

GAC AAT TAC CGC TGG 

asp asn tyr arg trp 



SEQ ID N0.34C 
FIGURE 34C 



REPLACEMENT SHEET (RULE 26) 



106/185 

ORF according to Cole et al . (Nature 393:537-544) containing seq34A 



1/1 

tag cog cag ggc 
AMB pro gin gly 
61/21 

ttg cag cag tta 
leu gin gin leu 
121/41 

agg gga gac gtt 
arg gly asp val 
181/61 

cca ggc cac cgc 
pro gly his arg 
241/81 

gta acc ggg aca 
val thr gly thr 
301/101 

teg ttg ctg aca 
ser leu leu thr 
361/121 

agg teg acc tgc 
arg ser thr cys 
421/141 

gcg acg gca ggc 
ala thr ala gly 
481/161 

etc acg ctt ggc 
leu thr leu gly 



cct gcg get agg 
pro ala ala arg 

caa cgc caa atg 
gin arg gin met 

gcg cag cga egg 
ala gin arg arg 

teg ctg ecg gca 
ser leu pro ala 

acc ggc acc acg 
thr gly thr thr 

att acc get ggc 
ile thr ala gly 

cag ate tea ecg 
gin ile ser pro 

cca etc tet tgg 
pro leu ser trp 

ttc atg ecg cag 
phe met pro gin 







31/11 


cgc 


ggc 


egg tgc 


arg 




arg cys 






91/31 




tct 


craa CQC 


glu 


ser 


glu arg 






151/51 


ate 


caa 


cct get 


ile 


gin 


pro ala 






211/71 


aca 


aca 


cct ggs 


thr 


thr 


pro gly 






271/91 


cct 


CQQ 


teg aga 


pro 


arg 


ser arg 






331/111 


cgc 


cgc 


ata ttt 


arg 


arg 


ile phe 






391/131 


cgc 


age 


ate tac 


arg 


ser 


ile tyr 






451/151 


cat 


gcg 


tec aat 


his 


ala 


ser asn 






511/171 


tee 


tac 


cca tgt 


ser 


tyr 


pro cys 



cgt tgg ecg egg 
arg trp pro arg 

ate gtc gag ttc 
ile val glu phe 

ggg cct gcg cct 
gly pro ala pro 

atg ggg acc ttt 
met gly thr phe 

cgt ate gcg gca 
arg ile ala ala 

gee gcg ctg ecg 
ala ala leu pro 

gee gtt cgc tgc 
ala val arg cys 

get gcg acg tec 
ala ala thr ser 

agt aae aga tag 
ser asn arg AMB 



egg caa teg atg 
arg gin ser met 

gat cag etc ggc 
asp gin leu gly 

teg aat cga egg 
ser asn arg arg 

teg gtg ttg ctg 
ser val leu leu 

gcg ttg gee ctg 
ala leu ala leu 

egg gee gga tec 
arg ala gly ser 

aaa ecg ecg act 
lys pro pro thr 

teg gta gac aag 
ser val asp lys 



SEQ ID N0.34F 



FIGURE 34F 



1/1 

CAG TCT GTC GGC AAG 
gin ser val gly lys 
61/21 

CAG ATC GAG AGC GCT 
gin ile glu ser ala 
121/41 

GGC TTC CGC GCA CCG 
gly phe arg ala pro 
181/61 

CTG GGG ATG TTG GTT 
leu gly met leu val 
241/81 

CTC AGC GTT TTC GGT 
leu ser val phe gly 
301/101 

CGG TTG TCC GGC AGG 
arg leu ser gly arg 
361/121 

ACC AAG GGG GCC GGG 
thr lys gly ala gly 



GAG GGA CGC ATG CCA 
glu gly arg met pro 

CTC TAC GCC GAA GAT 
leu tyr ala glu asp 

ACC GCG CGG CGG CGC 
thr ala arg arg arg 

TCC GGC GTG GCG TTC 
ser gly val ala phe 

TTT GTC GTG ATG TTC 
phe val val met phe 

ATG GAT CGT GGC GGA 
met asp arg gly gly 

GGC TCA TTC ACC AGC 
gly ser phe thr ser 



31/11 

CTC TCC GAT CAT GAG 
leu ser asp his glu 
91/31 

CCC AAG TTC GCA TCG 
pro lys phe ala ser 
151/51 

CTG CAG GGC GCG GCG 
leu gin gly ala ala 
211/71 

AAA GAG ACC ATG ATC 
lys glu thr met ile 
271/91 

GGT GGT GTG GTG TAT 
gly gly val val tyr 
331/111 

TCG GCT GCT GGG GCT 
ser ala ala gly ala 
391/131 

CGT ATG GAA GAT C 
arg met glu asp 



CAG CGG ATG CTT GAC 
gin arg met leu asp 

AGT GTC CGT GGC GGG 
ser val arg gly gly 

TTG TTC ATC ATC GGT 
leu phe ile ile gly 

GGA AGT TTC CCG ATA 
gly ser phe pro ile 

GCC ATC ACC GGT CCT 
ala ile thr gly pro 

TCG CGC CAG CGT CGT 
ser arg gin arg arg 



SEQ ID No,35A 



FIGURE 35A 
REPLACEMENT SHEET (RULE 2^ 



107/185 



1/1 31/11 

GAC ACT CTG TCG GCA AGG AGG GAC GCA TGC CAC TCT CCG ATC ATG AGC AGC GGA TGC TTG 
asp ser leu ser ala arg arg asp ala cys his ser pro ile met ser ser gly cys leu 
61/21 91/31 

ACC AGA TCG AGA GCG CTC TCT ACG CCG AAG ATC CCA AGT TCG CAT CGA GTG TCC GTG GCG 
thr arg ser arg ala leu ser thr pro lys ile pro ser ser his arg val ser val ala 
121/41 151/51 

GGG GCT TCC GCG CAC CGA CCG CGC GGC GGC GCC TGC AGG GCG CGG CGT TGT TCA TCA TCG 
gly ala ser ala his arg pro arg gly gly ala cys arg ala arg arg cys ser ser ser 
181/61 211/71 

GTC TGG GGA TGT TGG TTT CCG GCG TGG CGT TCA AAG AGA CCA TGA TCG GAA GTT TCC CGA 
val trp gly cys trp phe pro ala trp arg ser lys arg pro OPA ser glu val ser arg 
241/81 271/91 

TAC TCA GCG TTT TCG GTT TTG TCG TGA TGT TCG GTG GTG TGG TGT ATG CCA TCA CCG GTC 
tyr ser ala phe ser val leu ser OPA cys ser val val trp cys met pro ser pro val 
301/101 331/111 

CTC GGT TGT CCG GCA GGA TGG ATC GTG GCG GAT CGG CTG CTG GGG CTT CGC GCC AGC GTC 

leu gly cys pro ala gly trp ile val ala asp arg leu leu gly leu arg ala ser val 

361/121 391/131 

GTA CCA AGG GGG CCG GGG GCT CAT TCA CCA GCC GTA TGG AAG ATC 

val pro arg gly pro gly ala his ser pro ala val trp lys ile 



SEQ ID N0.35B 
FIGURE 35B 



1/1 31/11 

ACA GTC TGT CGG CAA GGA GGG ACG CAT GCC ACT CTC CGA TCA TGA GCA GCG GAT GCT TGA 
thr val cys arg gin gly gly thr his ala thr leu arg ser OPA ala ala asp ala OPA 
61/21 91/31 

CCA GAT CGA GAG CGC TCT CTA CGC CGA AGA TCC CAA GTT CGC ATC GAG TGT CCG TGG CGG 
pro asp arg glu arg ser leu arg arg arg ser gin val arg ile glu cys pro trp arg 
121/41 151/51 

GGG CTT CCG CGC ACC GAC CGC GCG GCG GCG CCT GCA GGG CGC GGC GTT GTT CAT CAT CGG 
gly leu pro arg thr asp arg ala ala ala pro ala gly arg gly val val his his arg 
181/61 211/71 

TCT GGG GAT GTT GGT TTC CGG CGT GGC GTT CAA AGA GAC CAT GAT CGG AAG TTT CCC GAT 
ser gly asp val gly phe arg arg gly val gin arg asp his asp arg lys phe pro asp 
241/81 271/91 

ACT CAG CGT TTT CGG TTT TGT CGT GAT GTT CGG TGG TGT GGT GTA TGC CAT CAC CGG TCC 
thr gin arg phe arg phe cys arg asp val arg trp cys gly val cys his his arg ser 
301/101 331/111 

TCG GTT GTC CGG CAG GAT GGA TCG TGG CGG ATC GGC TGC TGG GGC TTC GCG CCA GCG TCG 

ser val val arg gin asp gly ser trp arg ile gly cys trp gly phe ala pro ala ser 

361/121 391/131 

TAC CAA GGG GGC CGG GGG CTC ATT CAC CAG CCG TAT GGA AGA TC 

tyr gin gly gly arg gly leu ile his gin pro tyr gly arg 



SEQ ID N0.35C 
FIGURE 35C 
REPLACEMENT SHEET (RULE 26) 



108/185 



sequence Rv2169c predicted by Cole et al . (Nature 393:537-544} and partially 
containing seq35A 



1/1 


















31/11 




















etc 


tec 


gat 


cat 




cag 


cao 


atg 


ctt gac 


cag 


ate 


aaa 


age 


get 


etc 


tac 


gcc 


Met pro 


leu 


ser 


asp 


his 


glu 


gin 


arg 


met 


leu asp 


aln 


ile 


glu 


ser 


ala 


leu 


t vr 


ala 


61/21 


















91/31 


















gaa gat 


ccc 


aag 


ttc 


aca 


tea 


aat 


ate 


cat 


aae aaa 


aae 


ttc 


eac 


aca 


cca 


acc 


aca 


caa 
^y y 


glu asp 


pro 


1 vs 
J. u 


phe 


ala 


ser 


ser 


val 


ara 


alv aly 


alv 


phe 


ara 


ala 


pro 


thr 


ala 


arg 


121/41 


















151/51 


















egg cgc 


ctg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


ate 


ate ggt 


ctg 


ggg 


atg 


ttg 


gtt 


tec 


ggc 


ata 


arg arg 


leu 


gin 


gly 


ala 


ala 


leu 


phe 


ile 


ile gly 


leu 


gly 


met 


leu 


val 


ser 


gly 


val 


181/61 


















211/71 


















gcg ttc 


aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ttc 


ccg ata 


etc 


age 


gtt 


ttc 


ggt 


ttt 


gtc 


gtg 


ala phe 


lys 


glu 


thr 


met 


ile 


gly 


ser 


phe 


pro ile 


leu 


ser 


val 


phe 


gly 


phe 


val 


val 


241/81 


















271/91 


















_ atg ttc 


ggt 


ggt 


gtg 


gtg 


tat 


gee 


ate 


acc 


ggt cct 


egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt 


-1 met phe 


gly 


gly 


val 


val 


tyr 


ala 


ile 


thr 


gly pro 


arg 


leu 


ser 


gly 


arg 


met 


asp 


arg 


D 301/101 


















331/111 


















ggc gga 


teg 


get 


get 


ggg 


get 


teg 


cgc 


cag 


cgt cgt 


acc 


aag 


ggg 


gcc 


ggg 


ggc 


tea 


ttc 


gly giy 


ser 


ala 


ala 


gly 


ala 


ser 


arg 


gin 


arg arg 


thr 


lys 


gly 


ala 


gly 


gly 


ser 


phe 


361/121 


















391/131 


















acc age 


cgt 


atg 


gaa 


gat 


egg 


ttc 


egg 


cgc 


cgc ttc 


gac 


gag 


taa 












thr ser 


arg 


met 


glu 


asp 


arg 


phe 


arg 


arg 


arg phe 


asp 


glu 


OCH 













SEQ ID N0.35D 
FIGURE 35D 



ORF according to Cole et al- (Nature 393:537-544) and containing Rv2169c 



1/1 


















31/11 


















tga cag 


tct 


gtc 


ggc 


aag 


gag 


gga 


cgc 


atg 


cca etc 


tec 


gat 


cat 


gag 


cag 


egg 


atg 


ctt 


OPA gin 


ser 


val 


gly 


lys 


glu 


gly 


arg 


met 


pro leu 


ser 


asp 


his 


glu 


gin 


arg 


met 


leu 


61/21 


















91/31 


















gac cag 


ate 


gag 


age 


get 


etc 


tac 


gcc 


gaa 


gat ccc 


aag 


ttc 


gca 


teg 


agt 


gtc 


cgt 


ggc 


asp gin 


ile 


glu 


ser 


ala 


leu 


tyr 


ala 


glu 


asp pro 


lys 


phe 


ala 


ser 


ser 


val 


arg 


gly 


121/41 


















151/51 


















ggg ggc 


ttc 


cgc 


gca 


ccg 


acc 


gcg 


egg 


egg 


cgc ctg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


ate 


ate 


gly gly 


phe 


arg 


ala 


pro 


thr 


ala 


arg 


arg 


arg leu 


gin 


gly 


ala 


ala 


leu 


phe 


ile 


ile 


181/61 


















211/71 


















ggt ctg 


ggg 


atg 


ttg 


gtt 


tee 


ggc 


gtg 


gcg 


ttc aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ttc 


ccg 


gly leu 


gly 


met 


leu 


val 


ser 


gly 


val 


ala 


phe lys 


glu 


thr 


met 


ile 


gly 


ser 


phe 


pro 


241/81 


















271/91 


















ata etc 


age 


gtt 


ttc 


ggt 


ttt 


gtc 


gtg 


atg 


ttc ggt 


ggt 


gtg 


gtg 


tat 


gee 


ate 


acc 


ggt 


ile leu 


ser 


val 


phe 


gly 


phe 


val 


val 


met 


phe gly 


gly 


val 


val 


tyr 


ala 


ile 


thr 


gly 


301/101 


















331/111 


















cct egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt 


ggc 


gga teg 


get 


get 


ggg 


get 


teg 


cgc 


cag 


cgt 


pro arg 


leu 


ser 


gly 


arg 


met 


asp 


arg 


gly 


gly ser 


ala 


ala 


gly 


ala 


ser 


arg 


gin 


arg 


361/121 


















391/131 


















cgt acc 


aag 


ggg 


gcc 


ggg 


ggc 


tea 


ttc 


acc 


age cgt 


atg 


gaa 


gat 


egg 


ttc 


egg 


cgc 


cgc 


arg thr 


lys 


gly 


ala 


gly 


gly 


ser 


phe 


thr 


ser arg 


met 


glu 


asp 


arg 


phe 


arg 


arg 


arg 



421/141 

ttc gac gag taa 
phe asp glu OCH 

SEQ ID 35F 
FIGURE 35F 

REPLACEMENT SHEET (RULE 26) 



109/185 



1/1 

GAC CTG GGA CGA AGA CGA CGG 

asp leu gly arg arg arg arg 
61/21 

CAA TGG ACA CCC GAC TAG GGT 

gin trp thr pro asp tyr gly 
121/41 

GTG GTC GGA GTG CCA CCC GAC 

val val gly val pro pro asp 
181/61 

ACG GCG CCG GTC TGG ATC ACC 

thr ala pro val trp ile thr 
241/81 

GGG GCA CCC GGT GGC ACC GTT 

gly ala pro gly gly thr val 
301/101 

GCC AAC GGC GGC CGG CTG GAC 

ala asn gly gly arg leu asp 
361/121 

GTC GAC CCC GAC GGC GCC GTC 

val asp pro asp gly ala val 









31/11 




CAG 


CAG 


CCG 


CAA TCA 


GAT 


gin 


gin 


pro 


gin ser 


asp 








91/31 




GCG 


CCT 


GCG 


CGG CTC 


GAC 


ala 


p ro 


ala 


arg leu 


asp 








151/51 




CAG 


GCC 


ACC 


GAC TTC 


GGC 


gin 


ala 


thr 


asp ohe 


alv 








211/71 




ATG 


CTG 


TGG 


CCG CTG 


GCC 


met 


leu 


trp 


pro leu 


ala 








271/91 




CCC 


GTC 


CGG 


CTG GTC 


GAC 


pro 


val 


arg 


leu val 


asp 








331/111 




ATC 


CTC 


CTG 


TCG GCG 


GCC 


ile 


leu 


leu 


ser ala 


ala 








391/131 




GGC 


CGA 


GCG 


CTG TGC 


CTG 


gly 


arg 


ala 


leu cys 


leu 



SEQ ID N0.36A 
FIGURE 36A 



CTA CCC GGT CCT GGT CAA CGT 
leu pro gly pro gly gin arg 

AAT GCG CGG TTC CTG TTG CCC 
asn ala arg phe leu leu pro 

TCC GCT GTT GCA CCA GAA ACG 
ser ala val ala pro glu thr 

GAC CGG CCC CGG TTG GCC CCC 
asp arg pro arg leu ala pro 

GAC GAC CTG GCA AAC TCG CTG 
asp asp leu ala asn ser leu 

GAG TTC GCC ACC AAC CGG GAA 
glu phe ala thr asn arg glu 

GCC ATC GAC CCA GAT C 
ala ile asp pro asp 



1/1 


















31/11 


















ACC TGG 


GAC 


GAA 


GAC 


GAC 


GGC 


AGC 


AGC 


CGC 


AAT CAG 


ATC 


TAG 


CCG 


GTC 


CTG 


GTC 


AAC 


GTC 


thr trp 


asp 


glu 


asp 


asp 


gly 


ser 


ser 


arg 


asn gin 


ile 


tyr 


pro 


val 


leu 


val 


asn 


val 


61/21 


















91/31 


















AAT GGA 


CAC 


CCG 


ACT 


ACG 


GTG 


CGC 


CTG 


CGC 


GGC TCG 


ACA 


ATG 


CGC 


GGT 


TCC 


TGT 


TGC 


CCG 


asn gly 


his 


pro 


thr 


thr 


val 


arg 


leu 


arg 


gly ser 


thr 


met 


arg 


gly 


ser 


cys 


cys 


pro 


121/41 


















151/51 


















TGG TCG 


GAG 


TGC 


CAC 


CCG 


ACC 


AGG 


CCA 


CCG 


ACT TCG 


GCT 


CCG 


CTG 


TTG 


CAC 


CAG 


AAA 


CGA 


trp ser 


glu 


cys 


his 


pro 


thr 


arg 


pro 


pro 


thr ser 


ala 


pro 


leu 


leu 


his 


gin 


lys 


arg 


181/61 


















211/71 


















CGG CGC 


CGG 


TCT 


GGA 


TCA 


CCA 


TGC 


TGT 


GGC 


CGC TGG 


CCG 


ACC 


GGC 


CCC 


GGT 


TGG 


CCC 


CCG 


arg arg 


arg 


ser 


gly 


ser 


pro 


cys 


cys 


gly 


arg trp 


pro 


thr 


gly 


pro 


gly 


trp 


pro 


pro 


241/81 


















271/91 


















GGG CAC 


CCG 


GTG 


GCA 


CCG 


TTC 


CCG 


TCC 


GGC 


TGG TCG 


ACG 


ACG 


ACC 


TGG 


CAA 


ACT 


CGC 


TGG 


gly his 


pro 


val 


ala 


pro 


phe 


pro 


ser 


gly 


trp ser 


thr 


thr 


thr 


trp 


gin 


thr 


arg 


trp 


301/101 


















331/111 


















CCA ACG 


GCG 


GCC 


GGC 


TGG 


ACA 


TCC 


TCC 


TGT 


CGG CGG 


CCG 


AGT 


TCG 


CCA 


CCA 


ACC 


GGG 


AAG 


pro thr 


ala 


ala 


gly 


trp 


thr 


ser 


ser 


cys 


arg arg 


pro 


ser 


ser 


pro 


pro 


thr 


gly 


lys 


361/121 


















391/131 


















TCG ACC 


CCG 


ACG 


GCG 


CCG 


TCG 


GCC 


GAG 


CGC 


TGT GCC 


TGG 


CCA 


TCG 


ACC 


CAG 


ATC 






ser thr 


pro 


thr 


ala 


pro 


ser 


ala 


glu 


arg 


cys ala 


trp 


pro 


ser 


thr 


gin 


ile 







SEQ ID N0.36B 

FIGURE 36B 

REPLACE MENT SHEET (RULE 26) 
mmmmmmmwmmmmmw 



110/185 



1/1 


















31/11 


















CCT GGG 


ACG 


AAG 


ACG 


ACG 


GCA 


GCA 


GCC 


GCA 


ATG AGA 


TCT 


ACC 


CGG 


TCC 


TGG 


TCA 


ACG 


TCA 


pro gly 


thr 


lys 


thr 


thr 


ala 


ala 


ala 


ala 


ile arg 


ser 


thr 


arg 


ser 


trp 


ser 


thr 


ser 


61/21 


















91/31 


















ATG GAG 


ACC 


CGA 


CTA 


CGG 


TGC 


GCC 


TGC 


GCG 


GCT CGA 


CAA 


TGC 


GCG 


GTT 


CCT 


GTT 


GCC 


CGT 


itiet asp 


thr 


arg 


leu 


arg 


cys 


ala 


cys 


ala 


ala arg 


gin 


cys 


ala 


val 


pro 


val 


ala 


arg 


121/41 


















151/51 


















GGT CGG 


AGT 


GCC 


ACC 


CGA 


CCA 


GGC 


CAC 


CGA 


CTT CGG 


CTC 


CGC 


TGT 


TGC 


ACC 


AGA 


AAC 


GAC 


gly arg 


ser 


ala 


thr 


arg 


pro 


gly his 


arg 


leu arg 


leu 


arg 


cys 


cys 


thr 


arg 


asn 


asp 


181/61 


















211/71 


















GGG GCC 


GGT 


CTG 


GAT 


CAC 


CAT 


GCT 


GTG 


GCC 


GCT GGC 


CGA 


CCG 


GCC 


CCG 


GTT 


GGC 


CCC 


CGG 


gly ala 


gly 


leu 


asp 


his 


his 


ala 


val 


ala 


ala gly arg pro 


ala 


pro 


val 


gly 


pro 


arg 


241/81 


















271/91 


















GGG ACC 


CGG 


TGG 


CAC 


CGT 


TCC 


CGT 


CCG 


GCT 


GGT CGA 


CGA 


CGA 


CCT 


GGC 


AAA 


CTC 


GCT 


GGC 


gly thr 


arg 


trp 


his 


arg 


ser 


arg 


pro 


ala 


gly arg 


arg 


arg 


pro 


gly 


lys 


leu 


ala 


gly 


301/101 


















331/111 


















CAA CGG 


CGG 


CCG 


GCT 


GGA 


CAT 


CCT 


CCT 


GTC 


GGC GGC 


CGA 


GTT 


CGC 


CAC 


CAA 


CCG 


GGA 


AGT 


gin arg 


arg 


pro 


ala 


crlv 


his 


pro 


pro 


val 


gly gly 


arg 


val 


ara 


his 


gin 


pro 


crlv 


ser 


361/121 


















391/131 


















CGA GCC 


CGA 


CGG 


CGC 


CGT 


CGG 


CCG 


AGC 


GCT 


GTG CCT 


GGC 


CAT 


CGA 


CCC 


AGA 


TC 






arg pro 


arg 


arg 


arg 


arg 


arg 


pro 


ser 


ala 


val pro 


gly 


his 


arg 


pro 


arg 






















SEQ 


ID 


No. 36 


c 






























FIGURE 36C 


















Coding 


sequence 


Rv3909 


predicted by Cole et 


al . , 


1998 


(Nature 


393 


537 


-544 


containing Seq 36A 






























1/1 


















31/11 


















GTG ACC 


GCA 


CTG 


CAA 


CTC 


GGC 


TGG 


GCC 


GCT 


TTG GCG 


CGC 


GTC 


ACC 


TCA 


GCG 


ATC 


GGC 


GTC 


met thr 


ala 


leu 


gin 


leu 


gly 


trp 


ala 


ala 


leu ala 


arg 


val 


thr 


ser 


ala 


ile 


gly 


val 


61/21 


















91/31 


















GTG GCC 


GGC 


CTC 


GGG 


ATG 


GCG 


CTC 


ACG 


GTA 


CCG TCG 


GCG 


GCA 


CCG 


CAC 


GCG 


CTC 


GCA 


GGC 


val ala 


gly 


leu 


gly 


met 


ala 


leu 


thr 


val 


pro ser 


ala 


ala 


pro 


his 


ala 


leu 


ala 


gly 


121/41 


















151/51 


















GAG CCC 


AGC 


CCG 


ACG 


CCT 


TTT 


GTC 


CAG 


GTC 


CGC ATC 


GAT 


CAG 


GTG 


ACC 


CCG 


GAC 


GTG 


GTG 


glu pro 


ser 


pro 


thr 


pro 


phe 


val 


gin 


val 


arg ile 


asp 


gin 


val 


thr 


pro 


asp 


val 


val 


181/61 


















211/71 


















ACC ACT 


TCC 


AGC 


GAA 


CCC 


CAT 


GTC 


ACC 


GTC 


AGC GGA 


ACG 


GTG 


ACC 


AAT 


ACC 


GGT 


GAC 


CGC 


thr thr 


ser 


ser 


glu 


pro 


his 


val 


thr 


val 


ser gly 


thr 


val 


thr 


asn 


thr 


gly 


asp 


arg 


241/81 


















271/91 


















CCA GTC 


CGC 


GAT 


GTG 


ATG 


GTC 


CGG 


CTT 


GAG 


CAC GCC 


GCC 


GCG 


GTC 


ACG 


TCG 


TCA 


ACG 


GCG 


pro val 


arg 


asp 


val 


met 


val 


arg 


leu 


glu 


his ala 


ala 


ala 


val 


thr 


ser 


ser 


thr 


ala 


301/101 


















331/111 


















TTA CGC 


ACC 


TCG 


CTC 


GAG 


GGC 


GGC 


ACC 


GAG 


CAG TAC 


CAG 


CCG 


GCC 


GCG 


GAC 


TTC 


CTC 


ACG 


leu arg 


thr 


ser 


leu 


asp 


gly 


gly 


thr 


asp 


gin tyr 


gin 


pro 


ala 


ala 


asp 


phe 


leu 


thr 



SEQ ID N0.36D 
FIGURE 36D 



REPLACEMENT SHEET (RULE 26) 



111/185 



o D -L / J. ± 










CCC 




CTA 


V3.1 3,13 


pro 


glu 


i eu 


O T / 1 /I T 

^ ZLJ 14 1 








1 Ulj do 








ser leu 


■t- K V 


arg 


pro 


fi O X / J- D X 
















val asn 


gly 


4- V. -w- 


pro 


O 1/ 1 D 1 
















plTO V3.1 


val 


gxy 


val 


O U X / Z. ^ X 
















cnx tflx 


aia 


pro 


vax 


O D X / ^ <:! X 










GCA 


CCC 


GGT 


pj_0 yj-y 


dx a. 


p ro 




"7 91 /94 1 

/ ^ X / *± X 
















l©u 3.I3 


asn 


gxy 


gxy 


7 1 /9fi1 

f 0 X / il. U X 








GAA GTC 


GAC 




GAC 


glu val 


asp 






841/281 








CTC ATC 


ACC 




A AT 


leu ile 


thr 


V ct X 


asn 


901/301 








GCT CAA 


CTA 






ala gin 


leu 


pro 


cfly 


961/321 








CTG GAT 


CGA 


i i Vj 




leu asp 


arg 


leu 


arg 


1021/341 






GCC GAG 


CTG 


(Zli.'V 




ala asp 


leu 


asp 


ala 


1081/361 






AGO CCC 


GCC 


car* 


ATP 


ser pro 


ala 


asp 


X X c 


1141/381 






CTG CCC 


GAC 






leu pro 


asp 


rfl \7 
yxy 


pro 


1201/401 






GTT GCC 


GTC 


GCG 


GCC 


val ala 


val 


ala 


ala 


1261/421 






TCC GCG 


CTC 


TTA 


CCC 


ser ala 


leu 


leu 


pro 


1321/441 






TTT GAT 


CCC 


GCG 


GTC 


phe asp 


pro 


ala 


val 


1381/461 






TAT CTA 


GAT 


CCC 


TCG 


tyr leu 


asp 


pro 


ser 



GAC CGC GGG CAA GAG 
asp arg gly gin glu 

TCG TTG GCC GTC AAC 
ser leu ala val asn 

GAC TAG GGT GCG CCT 
asp tyr gly ala pro 

CCA CCC GAC CAG GCC 
pro pro asp gin ala 

TGG ATC ACC ATG CTG 
trp ile thr met leu 

GGC ACC GTT CCC GTC 
gly thr val pro val 

CGG CTG GAC ATC CTC 
arg leu asp ile leu 

GGC GCC GTC GGC CGA 
gly ala val gly arg 

GCG ATG ACC GGC GGC 
ala met thr gly gly 

ACC CCG ACC CAC CCG 
thr pro thr his pro 

ACG CTA GTC CAC CGG 
thr leu val his arg 

TTG CAG CGG GTT AAT 
leu gin arg val asn 

GTC GAC CGC ATC CTG 
val asp arg ile leu 

TTG ACC GGC CGG GCG 
leu thr gly arg ala 

GCC GAT TTT AGC CCC 
ala asp phe ser pro 

GCT ACC GCG CCC CGG 
ala thr ala pro arg 

GGG GCC GCG CTG GCC 
gly ala ala leu ala 

TTG TTC GTT CGG ATC 
leu phe val arg ile 



391/131 








GCC GGC 


TTT 


ACC 


CTC 


ala gly phe 


L.ii X 


leu 


>i c, -1 / -1 C -I 
*i 0 X/ X D X 








PAP PPP 


PPP 


ATP 


TAP 


gin pro 


gly 


ile 


L-yx 


0 1 1/ 1 / 1 










CTC 


GAC 


AAT 


ala arg 


leu 


asp 


asn 


0 / I/ 1^1 








TV r*r^ f^Tsc 
AOL, (oAU 






TPP 


thr asp 


phe 


^ly 


ser 


DO X / ^1 X X 








rr^firi ppp 


Vj. 1 0 






trp pro 


1 eu 


ala 


asp 


691/23 1 








cr^c PTP 


PTP 


GAC 


GAC 


arg leu 


vax 


asp 


a sp 


71:: -1 /OCL-1 








CTG TCG 


GCG 


GCC 


GAG 


leu ser 


ala 


ala 


glu 


811/271 








GCG CTG 


TGC 


CTG 


GCC 


ala leu 


cys 


leu 


ala 


871/291 








TAG GTC 


GTG 


TCC 


GAC 


tyr val 


val 


ser 


asp 


931/311 








GGC ACC 


GGC 


PA^^ 


CCC 


gly thr 


gly 


gin 


ala 


991/331 








ACA TGC 


GTG 




CCC 


thr cys 


val 


UilX 


pro 


1051/351 






GAT CCG 


AGG 


CTG 


AGC 


asp pro 


arg 


leu 


ser 


1111/371 






GAT GTC 


AGC 


TCC 


ACC 


asp val 


ser 


ser 


1-hr 


1171/391 






ATC AAC 


TTG 


CTC 


AGC 


ile asn 


leu 


leu 


ser 


1231/411 






GAG GAA 


CAG 


CAG 


GGT 


glu glu 


gin 


gin 


gly 


1291/431 






CGG TTG 


TCC 


CCG 


CGG 


arg leu 


ser 


pro 


arg 


1351/451 






GCC GCG 


GGA 


ACA 


AAC 


ala ala 


gly 


thr 


asn 


1411/471 






GCG CAT 


GAA 


TCG 


ATC 


ala his 


glu 


ser 


ile 



TCG GCC CCG CTG CGC 
ser ala pro leu arg 

CCG GTC CTG GTC AAC 
pro val leu val asn 

GCG CGG TTC CTG TTG 
ala arg phe leu leu 

GCT GTT GCA CCA GAA 
ala val ala pro glu 

CGG CCC CGG TTG GCC 
arg pro arg leu ala 

GAC CTG GCA AAC TCG 
asp leu ala asn ser 

TTC GCC ACC AAC CGG 
phe ala thr asn arg 

ATC GAC CCA GAT CTA 
ile asp pro asp leu 

TCG CCC GAC GGG GCC 
ser pro asp gly ala 

GCC GCA TCC AGC TGG 
ala ala ser ser trp 

CTG CCT TTT GCC CAA 
leu pro phe ala gin 

GCG ATC GCA ACC ATC 
ala ile ala thr ile 

CGC GGC GCA ACC GTG 
arg gly ala thr val 

ACC CAC GGC AAC ACG 
thr his gly asn thr 

TCG TCC CAG ATC GGC 
ser ser gin ile gly 

GTG GTA GCG GCG CCG 
val val ala ala pro 

CCG ACC GTT CCT ACC 
pro thr val pro thr 

ACC GCG CGC CGC CAG 
thr ala arg arg gin 



SEQ ID No. 3 6D (continued 1) 

FIGURE 36D (continued 1) 
REPLACEMENT SHEET (RULE 26) 



112/185 



1441/481 1471/491 

GAC GCC TTG GGC GCA ATG CTG TGG CGC AGC TTG GAG CCG AAT GCC GCG CCC CGT ACC CAA 
asp ala leu gly ala met leu trp arg ser leu glu pro asn ala ala pro arg thr gin 
1501/501 1531/511 

ATC CTG GTG CCG CCG GCG TCG TGG AGC CTG GCC AGC GAC GAC GCG CAG GTC ATC CTG ACC 
lie leu val pro pro ala ser trp ser leu ala ser asp asp ala gin val lie leu thr 
1561/521 1591/531 

GCG CTG GCC ACC GCC ATC CGG TCT GGC CTG GCC GTG CCG CGA CCA CTA CCG GCG GTG ATC 
ala leu ala thr ala lie arg ser gly leu ala val pro arg pro leu pro ala val ile 
1621/541 1651/551 

GCT GAC GCC GCG GCC CGC ACC GAG CCA CCG GAA CCC CCG GGC GCT TAC AGC GCC GCT CGC 
ala asp ala ala ala arg thr glu pro pro glu pro pro gly ala tyr ser ala ala arg 
1681/561 1711/571 

GGC CGG TTC AAT GAC GAC ATC ACC ACG CAG ATC GGC GGG CAG GTT GCC CGG CTA TGG AAG 
qlv arg phe asn asp asp ile thr thr gin ile gly gly gin val ala arg leu trp lys 
1741/581 1771/591 

CTG ACC TCG GCG TTG ACC ATC GAT GAC CGC ACC GGG CTG ACC GGC GTG CAG TAC ACC GCA 
leu thr ser ala leu thr ile asp asp arg thr gly leu thr gly val gin tyr thr ala 
1801/601 1831/611 

CCA CTA CGC GAG GAC ATG TTG CGC GCG CTG AGC CAA TCG CTA CCA CCC GAT ACC CGC AAC 
pro leu arg glu asp met leu arg ala leu ser gin ser leu pro pro asp thr arg asn 
1861/621 1891/631 

GGG CTG GCC CAG CAG CGG CTG GCC GTC GTT GGA AAG ACG ATC GAC GAT CTT TTC GGC GCG 
gly leu ala gin gin arg leu ala val val gly lys thr ile asp asp leu phe gly ala 
1921/641 1951/651 

GTG ACC ATC GTC AAC CCG GGC GGC TCC TAC ACT CTG GCC ACC GAG CAC AGT CCG CTG CCG 
val thr ile val asn pro gly gly ser tyr thr leu ala thr glu his ser pro leu pro 
1981/661 2011/671 

TTG GCG CTG CAT AAT GGC CTC GCC GTG CCA ATC CGG GTC CGG CTA CAG GTC GAT GCT CCG 
leu ala leu his asn gly leu ala val pro ile arg val arg leu gin val asp ala pro 
2041/681 2071/691 

CCC GGG ATG ACG GTG GCC GAT GTC GGT CAG ATC GAG CTA CCG CCC GGG TAC CTG CCG CTA 
pro gly met thr val ala asp val gly gin ile glu leu pro pro gly tyr leu pro leu 
2101/701 2131/711 

CGA GTA CCA ATC GAG GTG AAC TTC ACA CAG CGG GTT GCC GTC GAC GTG TCG CTG CGG ACC 
arg val pro ile glu val asn phe thr gin arg val ala val asp val ser leu arg thr 
2161/721 2191/731 

CCC GAC GGC GTC GCG CTG GGT GAA CCG GTG CGG TTG TCG GTG CAC TCC AAC GCC TAC GGC 
pro asp gly val ala leu gly glu pro val arg leu ser val his ser asn ala tyr gly 
2221/741 2251/751 

AAG GTG TTG TTC GCG ATC ACG CTA TCC GCT GCG GCC GTG CTG GTA ACG CTG GCG GGC CGG 
lys val leu phe ala ile thr leu ser ala ala ala val leu val thr leu ala gly arg 
2281/761 2311/771 

CGC CTT TGG CAC CGG TTC CGT GGC CAG CCT GAT CGC GCC GAC CTG GAT CGC CCC GAC CTG 
arq leu trp his arg phe arg gly gin pro asp arg ala asp leu asp arg pro asp leu 
2341/781 2371/791 

CCT ACC GGC AAA CAC GCC CCG CAG CGC CGT GCC GTA GCC AGT CGG GAT GAC GAA AAG CAC 
pro thr gly lys his ala pro gin arg arg ala val ala ser arg asp asp glu lys his 
2401/801 
CGG GTA TGA 
arg val OPA 



SEQ ID N0.36D (continued 2) 
FIGURE 36D (continued 2) 



REPLACEMENT SHEET (RULE 26) 



113/185 



ORF according to Cole et al . , 1998 (Nature 393 537-544) and containing Rv 3909. 
1/1 31/11 



TGA CTC AGC ACC GGG TCA GCA CAA CGG TCC CGG GCC GGG GCC GTG ACC GCA CTG CAA CTC 
OPA leu ser thr gly ser ala gin arg ser arg ala gly ala val thr ala leu gin leu 
61/21 91/31 

GGC TGG GCC GCT TTG GCG CGC GTC ACC TCA GCG ATC GGC GTC GTG GCC GGC CTC GGG ATG 
gly trp ala ala leu ala arg val thr ser ala ile gly val val ala gly leu gly met 
121/41 151/51 

GCG CTC ACG GTA CCG TCG GCG GCA CCG CAC GCG CTC GCA GGC GAG CCC AGC CCG ACG CCT 
ala leu thr val pro ser ala ala pro his ala leu ala gly glu pro ser pro thr pro 
181/61 211/71 

TTT GTC CAG GTC CGC ATC GAT CAG GTG ACC CCG GAC GTG GTG ACC ACT TCC AGC GAA CCC 
phe val gin val arg ile asp gin val thr pro asp val val thr thr ser ser glu pro 
241/81 271/91 

CAT GTC ACC GTC AGC GGA ACG GTG ACC AAT ACC GGT GAC CGC CCA GTC CGC GAT GTG ATG 
his val thr val ser gly thr val thr asn thr gly asp arg pro val arg asp val met 
301/101 331/111 

GTC CGG CTT GAG CAC GCC GCC GCG GTC ACG TCG TCA ACG GCG TTA CGC ACC TCG CTC GAC 
val arg leu glu his ala ala ala val thr ser ser thr ala leu arg thr ser leu asp 
361/121 391/131 

GGC GGC ACC GAC CAG TAC CAG CCG GCC GCG GAC TTC CTC ACG GTC GCC CCC GAA CTA GAC 
gly gly thr asp gin tyr gin pro ala ala asp phe leu thr val ala pro glu leu asp 
421/141 451/151 

CGC GGG CAA GAG GCC GGC TTT ACC CTC TCG GCC CCG CTG CGC TCG CTG ACC AGG CCG TCG 
arg gly gin glu ala gly phe thr leu ser ala pro leu arg ser leu thr arg pro ser 
481/161 511/171 

TTG GCC GTC AAC CAG CCC GGG ATC TAC CCG GTC CTG GTC AAC GTC AAT GGG ACA CCC GAC 
leu ala val asn gin pro gly ile tyr pro val leu val asn val asn gly thr pro asp 
541/181 571/191 

TAC GGT GCG CCT GCG CGG CTC GAC AAT GCG CGG TTC CTG TTG CCC GTG GTC GGA GTG CCA 
tyr gly ala pro ala arg leu asp asn ala arg phe leu leu pro val val gly val pro 
601/201 631/211 

CCC GAC CAG GCC ACC GAC TTC GGC TCC GCT GTT GCA CCA GAA ACG ACG GCG CCG GTC TGG 
pro asp gin ala thr asp phe gly ser ala val ala pro glu thr thr ala pro val trp 
661/221 691/231 

ATC ACC ATG CTG TGG CCG CTG GCC GAC CGG CCC CGG TTG GCC CCC GGG GCA CCC GGT GGC 
ile thr met leu trp pro leu ala asp arg pro arg leu ala pro gly ala pro gly gly 

751/251 

ACC GTT CCC GTC CGG CTG GTC GAC GAC GAC CTG GCA AAC TCG CTG GCC AAC GGC GGC CGG 
thr val pro val arg leu val asp asp asp leu ala asn ser leu ala asn gly gly arg 
781/261 811/271 

CTG GAC ATC CTC CTG TCG GCG GCC GAG TTC GCC ACC AAC CGG GAA GTC GAC CCC GAC GGC 
leu asp ile leu leu ser ala ala glu phe ala thr asn arg glu val asp pro asp gly 
841/281 871/291 

GCC GTC GGC CGA GCG CTG TGC CTG GCC ATC GAC CCA GAT CTA CTC ATC ACC GTC AAT GCG 
ala val gly arg ala leu cys leu ala ile asp pro asp leu leu ile thr val asn ala 
901/301 931/311 

ATG ACC GGC GGC TAC GTC GTG TCC GAC TCG CCC GAC GGG GCC GCT CAA CTA CCG GGC ACC 
met thr gly gly tyr val val ser asp ser pro asp gly ala ala gin leu pro gly thr 
961/321 991/331 

CCG ACC CAC CCG GGC ACC GGC CAG GCC GCC GCA TCC AGC TGG CTG GAT CGA TTG CGG ACG 
pro thr his pro gly thr gly gin ala ala ala ser ser trp leu asp arg leu arg thr 



SEQ ID N0.36F 
FIGURE 36F 
REPLACEMENT SHEET (RULE 26) 



114/185 



1021/341 

CTA GTC CAC CGG ACA TGC GTG ACG CCG CTG 
leu val his arg thr cys val thr pro leu 
1081/361 

CAG CGG GTT AAT GAT CCG AGG CTG AGC GCG 
gin arg val asn asp pro arg leu ser ala 
1141/381 

GAC CGC ATC CTG GAT GTC AGC TCC ACC CGC 
asp arg ile leu asp val ser ser thr arg 
1201/401 

ACC GGC CGG GCG ATC AAC TTG CTC AGC ACC 
thr gly arg ala ile asn leu leu ser thr 
1261/421 

GAT TTT AGC CCC GAG GAA CAG CAG GGT TCG 
asp phe ser pro glu glu gin gin gly ser 
1321/441 

ACC GCG CCC CGG CGG TTG TCC CCG CGG GTG 
thr ala pro arg arg leu ser pro arg val 
1381/461 

GCC GCG CTG GCC GCC GCG GGA ACA AAC CCG 
ala ala leu ala ala ala gly thr asn pro 
1441/481 

TTC GTT CGG ATC GCG CAT G7^ TCG ATC ACC 
phe val arg ile ala his glu ser ile thr 
1501/501 

CTG TGG CGC AGC TTG GAG CCG AAT GCC GCG 
leu trp arg ser leu glu pro asn ala ala 
1561/521 

TCG TGG AGC CTG GCC AGC GAC GAC GCG CAG 
ser trp ser leu ala ser asp asp ala gin 
1621/541 

CGG TCT GGC CTG GCC GTG CCG CGA CCA CTA 
arg ser gly leu ala val pro arg pro leu 
1681/561 

ACC GAG CCA CCG GAA CCC CCG GGC GCT TAC 
thr glu pro pro glu pro pro gly ala tyr 
1741/581 

ATC ACC ACG CAG ATC GGC GGG CAG GTT GCC 
ile thr thr gin ile gly gly gin val ala 
1801/601 

ATC GAT GAC CGC ACC GGG CTG ACC GGC GTG 
ile asp asp arg thr gly leu thr gly val 
1861/621 

TTG CGC GCG CTG AGC CAA TCG CTA CCA CCC 
leu arg ala leu ser gin ser leu pro pro 
1921/641 

CTG GCC GTC GTT GGA AAG ACG ATC GAC GAT 
leu ala val val gly lys thr ile asp asp 
1981/661 

GGC GGC TCC TAC ACT CTG GCC ACC GAG CAC 
gly gly ser tyr thr leu ala thr glu his 



SEQ ID 36F 
FIGURE 36F 



1051/351 

CCT TTT GCC CAA GCC GAC CTG GAT GCT TTG 
pro phe ala gin ala asp leu asp ala leu 
1111/371 

ATC GCA ACC ATC AGC CCC GCC GAC ATC GTC 
ile ala thr ile ser pro ala asp ile val 
1171/391 

GGC GCA ACC GTG CTG CCC GAC GGC CCG TTG 
gly ala thr val leu pro asp gly pro leu 
1231/411 

CAC GGC AAC ACG GTT GCC GTC GCG GCC GCC 

his gly asn thr val ala val ala ala ala 
1291/431 

TCC CAG ATC GGC TCC GCG ^CTC TTA CCC GCT 
ser gin ile gly ser ala leu leu pro ala 
1351/451 

GTA GCG GCG CCG TTT GAT CCC GCG GTC GGG 
val ala ala pro phe asp pro ala val gly 
1411/471 

ACC GTT CCT ACC TAT CTA GAT CCC TCG TTG 
thr val pro thr tyr leu asp pro ser leu 
1471/491 

GCG CGC CGC CAG GAC GCC TTG GGC GCA ATG 
ala arg arg gin asp ala leu gly ala met 
1531/511 

CCC CGT ACC CAA ATC CTG GTG CCG CCG GCG 
pro arg thr gin ile leu val pro pro ala 
1591/531 

GTC ATC CTG ACC GCG CTG GCC ACC GCC ATC 
val ile leu thr ala leu ala thr ala ile 
1651/551 

CCG GCG GTG ATC GCT GAC GCC GCG GCC CGC 
pro ala val ile ala asp ala ala ala arg 
1711/571 

AGC GCC GCT CGC GGC CGG TTC AAT GAC GAC 
ser ala ala arg gly arg phe asn asp asp 
1771/591 

CGG CTA TGG AAG CTG ACC TCG GCG TTG ACC 
arg leu trp lys leu thr ser ala leu thr 
1831/611 

CAG TAC ACC GCA CCA CTA CGC GAG GAC ATG 
gin tyr thr ala pro leu arg glu asp met 
1891/631 

GAT ACC CGC AAC GGG CTG GCC CAG CAG CGG 
asp thr arg asn gly leu ala gin gin arg 
1951/651 

GTT TTC GGC GCG GTG ACC ATC GTC AAC CCG 
leu phe gly ala val thr ile val asn pro 
2011/671 

AGT CCG CTG CCG TTG GCG CTG CAT AAT GGC 
ser pro leu pro leu ala leu his asn gly 



(continued 1) 
(continued 1) 



REPLACEMENT SHEET (RULE 26) 



115/185 



2041/681 

CTC GCC GTG CCA ATC 
leu ala val pro lie 
2101/701 

GAT GTC GGT CAG ATC 
asp val gly gin lie 
2161/721 

AAC TTC ACA CAG CGG 
asn phe thr gin arg 
2221/741 

GGT GAA CCG GTG CGG 
gly glu pro val arg 
2281/761 

ACG CTA TCC GCT GCG 
thr leu ser ala ala 
2341/781 

CGT GGC CAG CCT GAT 
arg gly gin pro asp 
2401/801 

CCG CAG CGC CGT GCC 
pro gin arg arg ala 



CGG GTC CGG CTA CAG 
arg val arg leu gin 

GAG CTA CCG CCC GGG 
glu leu pro pro gly 

GTT GCC GTC GAC GTG 
val ala val asp val 

TTG TCG GTG CAC TCC 
leu ser val his ser 

GCC GTG CTG GTA ACG 
ala val leu val thr 

CGC GCC GAC CTG GAT 
arg ala asp leu asp 

GTA GCC AGT CGG GAT 
val ala ser arg asp 



2071/691 

GTC GAT GCT CCG CCC 
val asp ala pro pro 
2131/711 

TAG CTG CCG CTA CGA 
tyr leu pro leu arg 
2191/731 

TCG CTG CGG ACC CCC 
ser leu arg thr pro 

2251/751 

AAC GCC TAC GGC AAG 

asn ala tyr gly lys 
2311/771 

CTG GCG GGC CGG CGC 
leu ala gly arg arg 
2371/791 

CGC CCC GAC CTG CCT 
arg pro asp leu pro 
2431/811 

GAC GAA AAG CAC CGG 
asp glu lys his arg 



GGG ATG ACG GTG GCC 
gly met thr val ala 

GTA CCA ATC GAG GTG 
val pro ile glu val 

GAC GGC GTC GCG CTG 
asp gly val ala leu 

GTG TTG TTC GCG ATC 
val leu phe ala ile 

CTT TGG CAC CGG TTC 
leu trp his arg phe 

ACC GGC AAA CAC GCC 
thr gly lys his ala 

GTA TGA 
val OPA 



SEQ ID 36F (continued 2) 
FIGURE 36F (continued 2) 



1/1 31/11 

ATC CGC GCG TTG GCG TCG CAT CCG AAC ATC GTC GGA GTC AAG GAC GCC AAA GCC GAC CTG 
ile arg ala leu ala ser his pro asn ile val gly val lys asp ala lys ala asp leu 
61/21 ^1/3^ 

CAC AGC GGC GCC CAA ATC ATG GCC GAC ACC GGA CTG GCC TAC TAT TCC GGC GAC GAC GCG 
his ser gly ala gin ile met ala asp thr gly leu ala tyr tyr ser gly asp asp ala 
121/41 151/51 

CTC AAC CTG CCC TGG CTG GCC ATG GGC GCC ACG GGC TTC ATC AGC GTG ATT GCC CAC CTG 
leu asn leu pro trp leu ala met gly ala thr gly phe ile ser val ile ala hxs leu 
181/61 211/71 

GCA GCC GGG CAG CTT CGA GAG TTG TTG TCC GCC TTC GGT TCT GGG GAT ATC GCC ACC GCC 

ala ala gly gin leu arg glu leu leu ser ala phe gly ser gly asp ile ala thr ala 

241/81 

CGC AAG ATC 

arg lys ile 



SEQ ID N0.37A 
FIGURE 37A 



REPLACEMENT SHEET (RULE 26) 



116/185 



1/1 

GAT CCG CGC GTT GGC GTC GCA 
asp pro arg val gly val ala 
61/21 

GCA CAG CGG CGC CCA AAT CAT 
ala gin arg arg pro asn his 
121/41 

GCT CAA CCT GCC CTG GCT GGC 
ala gin pro ala leu ala gly 
181/61 

GGC AGC CGG GCA GCT TCG AGA 
gly ser arg ala ala ser arg 
241/81 

CCG CAA GAT C 
pro gin asp 









31/11 \ 




TCC 


GAA 


CAT 


CGT CGG 


AGT 


ser 


glu 


his 


arg arg 


ser 








91/31 




GGC 


CGA 


CAC 


CGG ACT 


GGC 


gly 


arg 


his 


arg thr 


gly 








151/51 




CAT 


GGG 


CGC 


CAC GGG 


CTT 


his 


gly 


arg 


his gly 


leu 








211/71 




GTT 


GTT 


GTC 


CGC CTT 


CGG 


val 


val 


val 


arg leu 


arg 



SEQ ID No,37B 
FIGURE 37B 



CAA GGA CGC CAA AGC CGA CCT 
gin gly arg gin ser arg pro 

CTA CTA TTC CGG CGA CGA CGC 
leu leu phe arg arg arg arg 

CAT CAG CGT GAT TGC CCA CCT 
his gin arg asp cys pro pro 

TTC TGG GGA TAT CGC CAC CGC 
phe trp gly tyr arg his arg 



1/1 

TCC GCG CGT TGG CGT CGC ATC 
ser ala arg trp arg arg lie 
61/21 

ACA GCG GCG CCC AAA TCA TGG 
thr ala ala pro lys ser trp 
121/41 

TCA ACC TGC CCT GGC TGG CCA 
ser thr cys pro gly trp pro 
IBl/ei 

CAG CCG GGC AGC TTC GAG AGT 
gin pro gly ser phe glu ser 
241/81 
GCA AGA TC 
ala arg 



31/11 



CGA 


ACA 


TCG 


TCG GAG 


TCA 


arg 


thr 


ser 


ser glu 


ser 








91/31 




CCG 


ACA 


CCG 


GAC TGG 


CCT 


pro 


thr 


pro 


asp trp 


pro 








151/51 




TGG 


GCG 


CCA 


CGG GCT 


TCA 


trp 


ala 


pro 


arg ala 


ser 








211/71 




TGT 


TGT 


CCG 


CCT TCG 


GTT 


cys 


cys 


pro 


pro ser 


val 



SEQ ID N0.37C 
FIGURE 37C 



AGG ACG CCA AAG CCG ACC TGC 
arg thr pro lys pro thr cys 

ACT ATT CCG GCG ACG ACG CGC 
thr lie pro ala thr thr arg 

TCA GCG TGA TTG CCC ACC TGG 
ser ala OPA leu pro thr trp 

CTG GGG ATA TCG CCA CCG CCC 
leu gly ile ser pro pro pro 



REPLACEMENT SHEET (RULE 26) 
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Coding sequence Rv2753c predicted by Cole et al., 1998 (Nature 393 537-544) 
containing Seq 37A 

1/1 31/11 

GTG ACC ACC GTC GGA TTG GAC GTC GCA GCG CGC CTA GGA ACC CTG CTG ACC GCG ATG GTG 
val thr thr val gly phe asp val ala ala arg leu gly thr leu leu thr ala met val 
61/21 91/31 

ACA CCG TTT AGC GGC GAT GGC TCC CTG GAC ACC GCC ACC GCG GCG CGG CTG GCC AAC CAC 
thr pro phe ser gly asp gly ser leu asp thr ala thr ala ala arg leu ala asn his 
121/41 151/51 

CTG GTC GAT CAG GGG TGC GAC GGT CTG GTG GTC TCG GGC ACC ACC GGC GAG TCG CCG ACC 
leu val asp gin gly cys asp gly leu val val ser gly thr thr gly glu ser pro thr 
181/61 211/71 

ACC ACC GAC GGG GAG AAA ATC GAG CTG CTG CGG GCC GTC TTG GAA GCG GTG GGG GAC CGG 
thr thr asp gly glu lys lie glu leu leu arg ala val leu glu ala val gly asp arg 
241/81 271/91 

GCC CGT GTT ATC GCC GGT GCC GGC ACC TAT GAC ACC GCG CAC AGC ATC CGG CTG GCC AAG 
ala arg val lie ala gly ala gly thr tyr asp thr ala his ser lie arg leu ala lys 
301/101 331/111 

GCT TGT GCG GCC GAG GGT GCG CAC GGG CTG CTG GTG GTC ACG CCC TAC TAT TCC AAG CCG 
ala cys ala ala glu gly ala his gly leu leu val val thr pro tyr tyr ser lys pro 
361/121 391/131 

CCG CAG CGG GGG CTG CAA GCC CAT TTC ACC GCC GTC GCC GAC GCG ACC GAG CTG CCG ATG 
pro gin arg gly leu gin ala his phe thr ala val ala asp ala thr glu leu pro met 
421/141 451/151 

CTG CTC TAT GAC ATC CCG GGG CGG TCG GCG GTG CCG ATC GAG CCC GAC ACG ATC CGC GCG 
leu leu tyr asp ile pro gly arg ser ala val pro ile glu pro asp thr ile arg ala 
481/161 511/171 

TTG GCG TCG CAT CCG AAC ATC GTC GGA GTC AAG GAC GCC AAA GCC GAC CTG CAC AGC GGC 
leu ala ser his pro asn ile val gly val lys asp ala lys ala asp leu his ser gly 
541/181 571/191 

GCC CAA ATC ATG GCC GAC ACC GGA CTG GCC TAC TAT TCC GGC GAC GAC GCG CTC AAC CTG 
ala gin ile met ala asp thr gly leu ala tyr tyr ser gly asp asp ala leu asn leu 
601/201 631/211 

CCC TGG CTG GCC ATG GGC GCC ACG GGC TTC ATC AGC GTG ATT GCC CAC CTG GCA GCC GGG 
pro trp leu ala met gly ala thr gly phe ile ser val ile ala his leu ala ala gly 
661/221 691/231 

CAG CTT CGA GAG TTG TTG TCC GCC TTC GGT TCT GGG GAT ATC GCC ACC GCC CGC AAG ATC 
gin leu arg glu leu leu ser ala phe gly ser gly asp ile ala thr ala arg lys ile 
721/241 751/251 

AAC ATT GCG GTC GCC CCG CTG TGC AAC GCG ATG AGC CGC CTG GGT GGG GTG ACG TTG TCC 
asn ile ala val ala pro leu cys asn ala met ser arg leu gly gly val thr leu ser 
781/261 811/271 

AAG GCG GGC TTG CGG CTG CAG GGC ATC GAC GTC GGT GAT CCC CGG CTG CCC CAG GTG GCC 
lys ala gly leu arg leu gin gly ile asp val gly asp pro arg leu pro gin val ala 
841/281 871/291 

GCG ACA CCG GAG CAG ATC GAC GCG TTG GCC GCC GAC ATG CGC GCG GCC TCG GTG CTT CGG 
ala thr pro glu gin ile asp ala leu ala ala asp met arg ala ala ser val leu arg 



901/301 

TGA 

OPA 



SEQ ID N0.37D 
FIGURE 37D 



REPLACEMENT SHEET (RULE 26) 



118/185 



ORF according to Cole et al . , 1998 (Nature 393 537-544) containing Rv2753c 
1/1 31/11 



TAA GGT GAG CGC CGT GGC CGA GAC CGC GCC GCT GCG CGT GCA ACT GAT CGC CAA GAC CGA 
OCH gly glu arg arg gly arg asp arg ala ala ala arg ala thr asp arg gin asp arg 
61/21 91/31 

CTT CTT GGC GCC ACC CGA CGT GCC CTG GAC CAC CGA CGC CGA CGG CGG ACC CGC GCT GGT 
leu leu gly pro thr arg arg ala leu asp his arg arg arg arg arg thr arg ala gly 
121/41 151/51 

CGA GTT CGC CGG CCG GGC CTG CTA TCA GAG CTG GTC CAA GCC CAA TCC CAA GAC CGC CAC 
arg val arg arg pro gly leu leu ser glu leu val gin ala gin ser gin asp arg his 
181/61 211/71 

CAA CGC CGG CTA CCT CCG GCA CAT CAT CGA CGT CGG ACA TTT CTC GGT GCT AGA GCA TGC 
gin arg arg leu pro pro ala his his arg arg arg thr phe leu gly ala arg ala cys 
241/81 271/91 

CAG CGT GTC GTT CTA CAT CAC CGG GAT CTC GCG ATC GTG CAC CCA CGA GCT GAT CCG CCA 
gin arg val val leu his his arg asp leu ala ile val his pro arg ala asp pro pro 
301/101 331/111 

CCG GCA TTT CTC CTA CTC GCA GCT CTC CCA GCG CTA CGT ACC CGA GAA GGA CTC GCG GGT 
pro ala phe leu leu leu ala ala leu pro ala leu arg thr arg glu gly leu ala gly 
361/121 391/131 

CGT CGT GCC GCC CGG CAT GGA GGA CGA CGC CGA CCT GCG CCA CAT CCT GAC CGA GGC CGC 
arg arg ala ala arg his gly gly arg arg arg pro ala pro his pro asp arg gly arg 
421/141 451/151 

CGA CGC CGC CCG CGC CAC CTA CAG CGA GCT GCT GGC CAA GCT GGA AGC CAA GTT CGC CGA 
arg arg arg pro arg his leu gin arg ala ala gly gin ala gly ser gin val arg arg 
481/161 511/171 

CCA ACC CAA CGC GAT CCT GCG CCG CAA GCA GGC CCG CCA AGC CGC CCG CGC GGT GCT GCC 
pro thr gin arg asp pro ala pro gin ala gly pro pro ser arg pro arg gly ala ala 
541/181 571/191 

CAA CGC CAC CGA AAC CCG CAT CGT GGT GAC CGG CAA CTA CCG GGC CTG GCG GCA CTT CAT 
gin arg his arg asn pro his arg gly asp arg gin leu pro gly leu ala ala leu hxs 
601/201 631/211 

CGC AAT GCG GGC CAG CGA GCA CGC CGA CGT GGA AAT CCG GCG ACT GGC CAT CGA ATG CCT 
arg asn ala gly gin arg ala arg arg arg gly asn pro ala thr gly his arg met pro 
661/221 691/231 

GCG CCA GCT CGC CGC CGT GGC CCC CGC GGT GTT CGC CGA CTT CGA GGT GAC CAC CCT GGC 
ala pro ala arg arg arg gly pro arg gly val arg arg leu arg gly asp his pro gly 
721/241 751/251 

CGA CGG CAC CGA GGT GGC GAC CAG CCC GTT GGC GAC CGA AGC CTG AGG CGG CGT GTC GCT 
arg arg his arg gly gly asp gin pro val gly asp arg ser leu arg arg arg val ala 
781/261 811/271 

GGA CAA ACA CGC GCG CTC GCG GCC GGG ATA AAG CGC CAG GTA ACC TTG GGA GCC GTG ACC 
gly gin thr arg ala leu ala ala gly ile lys arg gin val thr leu gly ala val thr 
841/281 871/291 

ACC GTC GGA TTC GAC GTC GCA GCG CGC CTA GGA ACC CTG CTG ACC GCG ATG GTG ACA CCG 
thr val gly phe asp val ala ala arg leu gly thr leu leu thr ala met val thr pro 
901/301 931/311 

TTT AGC GGC GAT GGC TCC CTG GAC ACC GCC ACC GCG GCG CGG CTG GCC AAC CAC CTG GTC 
phe ser gly asp gly ser leu asp thr ala thr ala ala arg leu ala asn his leu val 
961/321 991/331 

GAT CAG GGG TGC GAC GGT CTG GTG GTC TCG GGC ACC ACC GGC GAG TCG CCG ACC ACC ACC 
asp gin gly cys asp gly leu val val ser gly thr thr gly glu ser pro thr thr thr 



SEQ ID N0.37F 
FIGURE 37F 

REPLACEMENT SHEET (RULE 26) 



119/185 



1021/341 1051/351 

GAC GGG GAG AAA ATC GAG CTG CTG CGG GCC GTC TTG GAA GCG GTG GGG GAG CGG GCC CGT 
asp gly glu lys ile glu leu leu arg ala val leu glu ala val gly asp arg ala arg 
1081/361 1111/371 

GTT ATC GCC GGT GCC GGC ACC TAT GAC ACC GCG CAC AGC ATC CGG CTG GCC AAG GCT TGT 
val ile ala gly ala gly thr tyr asp thr ala his ser ile arg leu ala lys ala cys 
1141/381 1171/391 

GCG GCC GAG GGT GCG CAC GGG CTG CTG GTG GTC ACG CCC TAC TAT TCC AAG CCG CCG CAG 
ala ala glu gly ala his gly leu leu val val thr pro tyr tyr ser lys pro pro gin 
1201/401 1231/411 

CGG GGG CTG CAA GCC CAT TTC ACC GCC GTC GCC GAC GCG ACC GAG CTG CCG ATG CTG CTC 
arg gly leu gin ala his phe thr ala val ala asp ala thr glu leu pro met leu leu 
1261/421 1291/431 

TAT GAC ATC CCG GGG CGG TCG GCG GTG CCG ATC GAG CCC GAC ACG ATC CGC GCG TTG GCG 
tyr asp ile pro gly arg ser ala val pro ile glu pro asp thr ile arg ala leu ala 
1321/441 1351/451 

TCG CAT CCG AAC ATC GTC GGA GTC AAG GAC GCC AAA GCC GAC CTG CAC AGC GGC GCC CAA 
ser his pro asn ile val gly val lys asp ala lys ala asp leu his ser gly ala gin 
1381/461 1411/471 

ATC ATG GCC GAC ACC GGA CTG GCC TAC TAT TCC GGC GAC GAC GCG CTC AAC CTG CCC TGG 
ile met ala asp thr gly leu ala tyr tyr ser gly asp asp ala leu asn leu pro trp 
1441/481 1471/491 

CTG GCC ATG GGC GCC ACG GGC TTC ATC AGC GTG ATT GCC CAC CTG GCA GCC GGG CAG CTT 
leu ala met gly ala thr gly phe ile ser val ile ala his leu ala ala gly gin leu 
1501/501 1531/511 

CGA GAG TTG TTG TCC GCC TTC GGT TCT GGG GAT ATC GCC ACC GCC CGC AAG ATC AAC ATT 
arg glu leu leu ser ala phe gly ser gly asp ile ala thr ala arg lys ile asn ile 
1561/521 1591/531 

GCG GTC GCC CCG CTG TGC AAC GCG ATG AGC CGC CTG GGT GGG GTG ACG TTG TCC AAG GCG 
ala val ala pro leu cys asn ala met ser arg leu gly gly val thr leu ser lys ala 
1621/541 1651/551 

GGC TTG CGG CTG CAG GGC ATC GAC GTC GGT GAT CCC CGG CTG CCC CAG GTG GCC GCG ACA 
gly leu arg leu gin gly ile asp val gly asp pro arg leu pro gin val ala ala thr 
1681/561 1711/571 

CCG GAG CAG ATC GAC GCG TTG GCC GCC GAC ATG CGC GCG GCC TCG GTG CTT CGG TGA 
pro glu gin ile asp ala leu ala ala asp met arg ala ala ser val leu arg OPA 



SEQ ID No,37F (continued 1) 
FIGURE 37F (continued 1) 



REPLACEMENT SHEET (RULE 26) 



120/185 



1/1 

GCG GTG AAC TGG TGG GCC CGG ATG GTT CAA 
ala val asn trp trp ala arg met val gin 
61/21 

AGA CGA CGG ATG GAA GGA GAT GCT GGC GCC 
arg arg arg met glu gly asp ala gly ala 
121/41 

TCG TCG TCT ACG GAG GTG T^AG GCG GCG GAT 
ser ser ser thr glu val lys ala ala asp 
181/61 

ACT GGC CCG CAG GTG AAG GCG GCG GAT TCG 
thr gly pro gin val lys ala ala asp ser 

241/81 

GAC GCG TGC CCA GAA CAG GCC CTC GTC GAG 
asp ala cys pro glu gin ala leu val glu 
301/101 

CTT GTT GGC ATT GCG GCG ACG CTG CTC GCG 
leu val gly ile ala ala thr leu leu ala 
361/121 

TTT GCG TTG CGC TCA CAC CAG GAA AGC CAA 
phe ala leu arg ser his gin glu ser gin 
421/141 

GAG GCC GCT AAG GAT TGC GTT GCG GCC ACG 
glu ala ala lys asp cys val ala ala thr 
481/161 

AGC ATG CAG AAG ATC 
ser met gin lys ile 



31/11 

GTA CGC CGT CGC AAA CTC GAG CAC AAC AGG 
val arg arg arg lys leu glu his asn arg 
91/31 

GGC CAG CTG AAC CCT GCC GAT GCG AAT AAG 
gly gin leu asn pro ala asp ala asn lys 
151/51 

TCG GCG GAA TCT GAC GCC GGA GCC GAC CAG 
ser ala glu ser asp ala gly ala asp gin 
211/71 

GCG GAA TCT GAC GCC GGA GAG CTC GGC GAG 
ala glu ser asp ala gly glu leu gly glu 
271/91 

CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG 
arg arg pro ser arg leu arg arg gly trp 
331/111 

TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT 
leu ala gly gly leu gly ala ala gly tyr 
391/131 

TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT 
ser ile ala arg glu asp leu ala ala ile 
451/151 

CAG GCA CCC GAT GCT GGG GCG ATG TCG GCT 
gin ala pro asp ala gly ala met ser ala 



SEQ ID N0.38A 



FIGURE 38A 



1/1 

CAG CGG TGA ACT GGT GGG CCC GGA TGG TTC 

gin arg OPA thr gly gly pro gly trp phe 
61/21 

GGA GAC GAC GGA TGG AAG GAG ATG CTG GCG 
gly asp asp gly trp lys glu met leu ala 
121/41 

AGT CGT CGT CTA CGG AGG TGA AGG CGG CGG 
ser arg arg leu arg arg OPA arg arg arg 
181/61 

AGA CTG GCC CGC AGG TGA AGG CGG CGG ATT 
arg leu ala arg arg OPA arg arg arg ile 
241/81 

AGG ACG CGT GCC CAG AAC AGG CCC TCG TCG 
arg thr arg ala gin asn arg pro ser ser 
301/101 

GGC TTG TTG GCA TTG CGG CGA CGC TGC TCG 
gly leu leu ala leu arg arg arg cys ser 
361/121 

ATT TTG CGT TGC GCT CAC ACC AGG AAA GCC 
ile leu arg cys ala his thr arg lys ala 
421/141 

TTG AGG CCG CTA AGG ATT GCG TTG CGG CCA 
leu arg pro leu arg ile ala leu arg pro 
481/161 

CTA GCA TGC AGA AGA TC 
leu ala cys arg arg 



31/11 


















AAG TAG 


GCC 


GTC 


GCA 


AAC 


TCG 


AGC 


ACA ACA 


lys tyr 


ala 


val 


ala 


asn 


ser 


ser 


thr 


thr 


91/31 


















CCG GCC 


AGC 


TGA ACC 


CTG 


CCG 


ATG 


CGA ATA 


pro ala 


ser 


OPA 


thr 


leu 


pro 


met 


arg 


ile 


151/51 


















ATT CGG 


CGG 


AAT 


CTG 


ACG 


CCG 


GAG 


CCG 


ACC 


ile arg 


arg 


asn 


leu 


thr 


pro 


glu pro 


thr 


211/71 


















CGG CGG 


AAT 


CTG 


ACG 


CCG 


GAG 


AGC 


TCG 


GCG 


arg arg 


asn 


leu 


thr 


pro 


glu 


ser 


ser 


ala 


271/91 


















AGC GGC 


GCC 


CGT 


CGC 


GGT 


TGC 


GGC 


GAG 


GCT 


ser gly 


ala 


arg 


arg 


gly 


cys 


gly glu 


ala 


331/111 


















CGT TGG 


CCG 


GTG 


GCC 


TTG 


GCG 


CAG 


CGG 


GTT 


arg trp 


pro 


val 


ala 


leu 


ala 


gin 


arg 


val 


391/131 


















AAT CAA 


TCG 


CGC 


GCG 


AGG 


ACC 


TTG 


CGG 


CCA 


asn gin 


ser 


arg 


ala 


arg 


thr 


leu 


arg 


pro 


451/151 


















CGC AGG 


CAC 


CCG 


ATG 


CTG 


GGG 


CGA 


TGT 


CGG 


arg arg 


his 


pro 


met 


leu 


gly 


arg 


cys 


arg 



SEQ ID N0.38B 
FIGURE 38B 
REPLACEMENT SHEET (RULE 26) 



121/185 



1/1 


















31/11 


















AGC GGT 


GAA 


CTG 


GTG 


GGC 


CCG 


GAT 


GGT 


TCA 


AGT ACG 


CCG 


TCG 


CAA 


ACT 


CGA 


GCA 


CAA 


CAG 


ser gly 


glu 


leu 


val 


giy 


pro 


asp 


gly 


ser 


ser thr 


pro 


ser 


gin 


thr 


arg 


ala 


gin 


gin 


61/21 
















91/31 


















GAG ACG 


ACG 


GAT 


GGA 


AGG 


AGA 


TGC 


TGG 


CGC 


CGG CCA 


GCT 


GAA 


CCC 


TGC 


CGA 


TGC 


GAA 


TAA 


glu thr 


thr 


asp 


gly 


arg 


arg 


cys 


trp 


arg 


arg pro 


ala 


glu 


pro 


cys 


arg 


cys 


glu 


OCH 


121/41 


















151/51 


















GTC GTC 


GTC 


TAG 


GGA 


GGT 


GAA 


GGC 


GGC 


GGA 


TTC GGC 


GGA 


ATC 


TGA 


CGC 


CGG 


AGC 


CGA 


CCA 


val val 


val 


tyr 


gly 


gly 


glu 


gly 


gly gly giy giy 


ile 


OPA 


arg 


arg 


ser 


arg 


pro 


181/61 
















211/71 


















GAG TGG 


CCC 


GCA 


GGT 


GAA 


GGC 


GGC 


GGA 


TTC 


GGC GGA 


ATC 


TGA 


CGC 




AtjA. 








asp trp 


pro 


ala 


gly 


glu 


gly 


gly 


gly 


phe 


gly gly 


ile 


OPA 


arg 


arg 


arg 


-.1-1 

ala 


arg 


arg 


241/81 
















271/91 


















GGA CGC 


GTG 


CCC 


AGA 


ACA 


GGC 


CCT 


CGT 


CGA 


GCG GCG 


CCC 
















gly arg 


val 


pro 


arg 


thr 


gly 


pro 


arg 


arg 


ala ala 


pro 


val 


ala 


val 


ala 


ala 


arg 


leu 


301/101 


















331/111 


















GCT TGT 


TGG 


CAT 


TGC 


GGC 


GAC 


GCT 


GCT 


CGC 


GTT GGC 


CGG 


TGG 


CCT 


TGG 


CGC 


AGC 


GGG 


TTA 


ala cys 


trp 


his 


cys 


gly 


asp 


ala 


ala 


arg 


val gly 


arg 


trp 


pro 


trp 


arg 


ser 


gly 


leu 


361/121 
















391/131 


















TTT TGG 


GTT 


GCG 


CTC 


ACA 


CCA 


GGA 


AAG 


CCA 


ATC AAT 


CGC 


GCG 


CGA 


GGA 


CCT 


TGC 


GGC 


CAT 


phe cys 


val 


ala 


leu 


thr 


pro 


gly 


lys 


pro 


ile asn 


arg 


ala 


arg 


gly 


pro 


cys 


giy 


his 


421/141 


















451/151 
















GGC 


TGA GGC 


CGC 


TAA 


GGA 


TTG 


CGT 


TGC 


GGC 


CAC 


GCA GGC 


ACC 


CGA 


TGC 


TGG 


GGC 


GAT 


GTC 


OPA gly 


arg 


OCH 


gly 


leu 


arg 


cys 


gly 


his 


ala gly 


thr 


arg 


cys 


trp 


gly 


asp 


val 


gly 


481/161 




































TAG CAT 


GCA 


GAA 


GAT 


C 




























AMB his 


ala 


glu 


asp 































SEQ ID N0.38C 
FIGURE 38C 



REPLACEMENT SHEET (RULE 26) 



122/185 



Sequence Rv0175 predicted by Cole et al . , 1998 (Nature 393 537-544) and 
containing seq38A 

1/1 31/11 

GTG AAG GCG GCG GAT TCG GCG GAA TOT GAG GCC GGA GCC GAG GAG ACT GGC CCG CAG GTG 
val lys ala ala asp ser ala giu ser asp ala gly ala asp gin thr gly pro gin val 
61/21 91/31 

AAG GCG GCG GAT TCG GCG GAA TCT GAG GCC GGA GAG CTC GGC GAG GAG GCG TGC CCA GAA 
lys ala ala asp ser ala glu ser asp ala gly glu leu gly glu asp ala cys pro glu 
121/41 151/51 

CAG GCC CTC GTC GAG CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG CTT GTT GGC ATT GCG 
gin ala leu val glu arg arg pro ser arg leu arg arg gly trp leu val gly ile ala 
181/61 211/71 

GCG ACG CTG CTC GCG TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT TTT GCG TTG CGC TCA 
ala thr leu leu ala leu ala gly gly leu gly ala ala gly tyr phe ala leu arg ser 
241/81 271/91 

CAC CAG GAA AGC CAA TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT GAG GCC GCT AAG GAT 
his gin glu ser gin ser ile ala arg glu asp leu ala ala ile glu ala ala lys asp 
301/101 331/111 

TGC GTT GCG GCC ACG CAG GCA CCC GAT GCT GGG GCG ATG TCG GCT AGC ATG CAG AAG ATC 
cys val ala ala thr gin ala pro asp ala gly ala met ser ala ser met gin lys ile 
361/121 391/131 

ATC GAG TGT GGC ACC GGT GAT TTC GGT GCC CAG GCG TCG TTG TAG ACC AGC ATG CTC GTC 
ile glu cys gly thr gly asp phe gly ala gin ala ser leu tyr thr ser met leu val 
421/141 451/151 

GAG GCG TAT CAA GCG GCC AGC GTC CAC GTG CAA GTG ACC GAT ATG CGC GCG GCG GTC GAG 
glu ala tyr gin ala ala ser val his val gin val thr asp met arg ala ala val glu 
481/161 511/171 

CGC AAC AAC AAT GAC GGG TCG GTC GAT GTT CTG GTG GCG CTC CGG GTC AAG GTG TGC AAC 
arg asn asn asn asp gly ser val asp val leu val ala leu arg val lys val ser asn 
541/181 571/191 

ACC GAC TCG GAT GCC CAT GAA GTC GGC TAG CGT CTT CGG GTC CGG ATG GCA CTG GAT GAG 

thr asp ser asp ala his glu val gly tyr arg leu arg val arg met ala leu asp glu 

601/201 631/211 

GGC CGC TAT AAG ATC GCC AAA CTC GAC CAG GTG ACG AAG TGA 

gly arg tyr lys ile ala lys leu asp gin val thr lys OPA 



SEQ ID No,38D 
FIGURE 38D 



REPLACEMENT SHEET (RULE 26) 



123/185 



ORF according to Cole et al . , 1998 {Nature 393 537-544) containing Rv0175 
1/1 31/11 

TGA ACT GGT GGG GCC GGA TGG TGT CAA GTA CGC CGT CGC AAA CTC GAG CAC AAC AGG AGA 
OPA thr gly gly ala gly trp cys gin val arg arg arg lys leu glu his asn arg arg 
61/21 91/31 

CGA CGG ATG GAA GGA GAT GCT GGC GCC GGC CAG CTG AAC CCT GCC GAT GCG AAT AAG TCG 
arg arg met glu gly asp ala gly ala gly gin leu asn pro ala asp ala asn lys ser 
121/41 151/51 

TCG TCT ACG GAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GCC GAC CAG ACT 
ser ser thr glu val lys ala ala asp ser ala glu ser asp ala gly ala asp gin thr 
181/61 211/71 

GGC CCG CAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GAG CTC GGC GAG GAC 
gly pro gin val lys ala ala asp ser ala glu ser asp ala gly glu leu gly glu asp 
241/81 271/91 

GCG TGC CCA GAA CAG GCC CTC GTC GAG CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG CTT 
ala cys pro glu gin ala leu val glu arg arg pro ser arg leu arg arg gly trp leu 
301/101 331/111 

GTT GGC ATT GCG GCG ACG CTG CTC GCG TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT TTT 
val gly ile ala ala thr leu leu ala leu ala gly gly leu gly ala ala gly tyr phe 
361/121 391/131 

GCG TTG CGC TCA CAC CAG GAA AGC CAA TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT GAG 
ala leu arg ser his gin glu ser gin ser ile ala arg glu asp leu ala ala ile glu 
421/141 451/151 

GCC GCT AAG GAT TGC GTT GCG GCC ACG CAG GCA CCC GAT GCT GGG GCG ATG TCG GCT AGC 
ala ala lys asp cys val ala ala thr gin ala pro asp ala gly ala met ser ala ser 
481/161 511/171 

ATG CAG AAG ATC ATC GAG TGT GGC ACC GGT GAT TTC GGT GCC CAG GCG TCG TTG TAC ACC 
met gin lys ile ile glu cys gly thr gly asp phe gly ala gin ala ser leu tyr thr 
541/181 571/191 

AGC ATG CTC GTC GAG GCG TAT CAA GCG GCC AGC GTC CAC GTG CAA GTG ACC GAT ATG CGC 
ser met leu val glu ala tyr gin ala ala ser val his val gin val thr asp met arg 
601/201 631/211 

GCG GCG GTC GAG CGC AAC AAC AAT GAC GGG TCG GTC GAT GTT CTG GTG GCG CTC CGG GTC 
ala ala val glu arg asn asn asn asp gly ser val asp val leu val ala leu arg val 
661/221 691/231 

AAG GTG TCC AAC ACC GAC TCG GAT GCC CAT GAA GTC GGC TAC CGT CTT CGG GTC CGG ATG 
lys val ser asn thr asp ser asp ala his glu val gly tyr arg leu arg val arg met 
721/241 751/251 

GCA CTG GAT GAG GGC CGC TAT AAG ATC GCC AAA CTC GAC CAG GTG ACG AAG TGA 
ala leu asp glu gly arg tyr lys ile ala lys leu asp gin val thr lys OPA 



SEQ ID N0.38F 
FIGURE 38F 



REPLACEMENT SHEET (RULE 26) 



124/185 



1/1 

ACA CCT CCC CCC CCG CCG CCG 
thr pro pro pro pro pro pro 
61/21 

GGC GTG ATG CAA GGC TGC CTT 
gly val met gin gly cys leu 
121/41 

ACC GCA CTG GTC GGC GAG GGC 
thr ala leu val ala glu arg 



31/11 

CTG CCG CCG GTT CCC TTT 
leu pro pro val pro phe 
91/31 

GAG AGC ACC AGC GGC TTG 
glu ser thr ser gly leu 
151/51 

ATC ACC GGT GCC GTC GAG 
lie thr gly ala val glu 



SEQ ID N0.39A 
FIGURE 39A 



CCC AAG GAA TGT CCG GCG CCG 
pro lys glu cys pro ala pro 

ATC ATG GGC ATC GAC AGC AAG 
lie met gly lie asp ser lys 

GAG ATC 
glu lie 



1/1 

CAC CTC CCC CCC CGC 

his leu pro pro arg 
61/21 

GCG TGA TGC PiAG GCT 

ala OPA cys lys ala 
121/41 

CCG CAC TGG TCG CCG 

pro his trp ser pro 



CGC CGC TGC CGC CGG 
arg arg cys arg arg 

GCC TTG AGA GCA CCA 
ala leu arg ala pro 

AGC GCA TCA CCG GTG 
ser ala ser pro val 



31/11 

TTC CCT TTC CCA AGG 
phe pro phe pro arg 
91/31 

GCG GCT TGA TCA TGG 
ala ala OPA ser trp 
151/51 

CCG TCG AGG AGA TC 
pro ser arg arg 



AAT GTC CGG CGC CGG 

asn val arg arg arg 

GCA TCG ACA GCA AGA 

ala ser thr ala arg 



SEQ ID N0.39B 



FIGURE 39B 



1/1 

GAC ACC TCC CCC CCC 
asp thr ser pro pro 
61/21 

GGG CGT GAT GCA AGG 
gly arg asp ala arg 
121/41 

GAC CGC ACT GGT CGC 
asp arg thr gly arg 



GCC GCC GCT GCC GCC 
ala ala ala ala ala 

CTG CCT TGA GAG CAC 
leu pro OPA glu his 

CGA GCG CAT CAC CGG 
arg ala his his arg 



31/11 

GGT TCC CTT TCC CAA 
gly ser leu ser gin 
91/31 

CAG CGG CTT GAT CAT 
gin arg leu asp his 
151/51 

TGC CGT CGA GGA GAT 
cys arg arg gly asp 



GGA ATG TCC GGC GCC 
gly met ser gly ala 

GGG CAT CGA CAG CAA 
gly his arg gin gin 

C 



SEQ ID N0.39C 



FIGURE 39C 



REPLACEMENT SHEET (RULE 26) 



125/185 



Coding sequence Rv3006 predxcted by Cole et al,, 1998 (Nature 393 537-544) and containing 
3eq3 9A 



1/1 31/11 





ATG TGG 


ACA 


ACG 


CGG 


TTG 


GTT 


CGA 


TCC 


GGA 


CTC GCC 


GCG 


CTG 


TGC 


GCG 


GCA 


GTG 


CTG 


GTA 




wee cjrp 


tnr 


unr 


arg 


leu 


val 


arg 


ser 


gly 


leu ala 


ala 


leu 


cys 


ala 


ala 


val 


leu 


val 






















y 1/ o 1 




















ICCj agc 


GGC 


TGC 


GCA 


CGCj 


TTC 


AAC 


GAC 


GCT 


CAA iCx 


CAG 


CCG 


i i C 


ACC 


ACC 


GAA 


CCG 


GAG 




O ^ e ^ V* 

ser sej. 


gly 


cys 


aia 


arg 


phe 


asn 


asp 


a1 a 

ai a 


gin ser 


gin 


pro 


phe 


tnr 


rnr 


glu 


pro 


glu 




191 / 4 1 


















XO X / O 1 


































CL. i 


err* 


CCC CCC 


CCC 


CCC 




CCC 




IjtX X 


CCC 


TTT 




leu arg 


pro 


gin 


pro 


ser 


ser 


thr 


pro 


pro 


pro pro 


pro 


pro 


leu 


pro 


pro 


val 


pro 


phe 










































CCC AAG 


GAA 


TGT 


CCG 


GCG 


CCG 


GGC 


GTG 


ATG 


CAA GGC 


TGC 


CTT 


GAG 


AGC 


ACC 


AGC 


GGC 


TTG 




pro lys 


glu 


cys 


pro 


aia 


pro 


gly 


vai 


met 


/*^1 r 

gm giy 


cys 


leu 


glu 


ser 


tnr 


ser 


gly 


leu 




O /I t / <5 1 

Z-^ 1/ o 1 


















OTI /Q1 




















ATL. Aitj 


GGC 


ATC 


GAC 


AtyC 


AAtjr 








C*PC cicc 


CTkC 


CCC 


AxL. 


ACL. 




CCC 


GTC 


GAG 




i 1 e me t 




lie 


asp 


ser 


lys 


unr 


a1 a 

aia 


leu 


val ala 


glu 


arg 


lie 


tnr 


gly 


a1 a 

aia 


vai 


glu 




-3 n 1 / 1 n 1 


















T /111 

1/111 


















= J 


tjACj Ai O 


TCT 


ATC 


AGC 


GCC 


GAG 


CCG 


7\ 7\ /*• 

AA.O- 


CjX A 


7\ 7i r** 7\ cc 
AAtj Av^tj 




AiU 


ccr* 


GTG 


GAT 


CCT 


GCC 


GGT 




giu lie 


ser 


lie 


ser 


al ct. 


glu 


pro 


lys 


val 


X y o uiix 


val 


IX^ 


pro 


\7a1 
Vdl 


asp 


pro 


a1 a 
dia 


gly 




J o 1/ xz X 


















J J X / X o X 




































X Vj 














t-XVor 


ATT" 
>VX kjr 


Tli C 




asp gly 


gly 


leu 


itie t 


asp 


lltr 


vai 


leu 


ser 


piO LXIl 


tyr 


ser 


gin 


asp 


arg 


leu 


iciet 


tyr 


Ul 


4Z 1/ Lh 1 


















flO 1/ lO 1 


















i i 


GCC TAC 


ATC 


AGC 


ACG 


CCC 


ACC 


GAC 


AAC 


CGG 




CGA 


GTG 


GCC 


GAC 


GGC 


GAC 


ATC 


CCC 


'i "]: 


ala tyr 
481/ 161 


lie 


ser 


cnr 


pro 


unr 


asp 


asn 


arg 


val val 

O 1 1/ 1 / I 


arg 


xra 1 

vai 


a1 a 

aia 


asp 


gly 


asp 


lie 


pro 




AA(j GAC 


ATC 


CTG 


ACC 


GGC 


ATC 


CCC 


AAA 


GGT 




GGT 


AAC 


ACC 


GGG 


GCG 


CTG 


ATC 


TTC 




lys asp 


-i 1 *a 

lie 


leu 


tnr 


gly 


lie 


pro 


lys 


gly 


ala ala 


gly 


asn 


unr 


gly 


al a 

aia 


leu 


lie 


phe 




o4 1/ iol 


















D i xf ly 1 




















ACC ACri 


CCC 


ACC 


ACG 


CTG 


GTC 


GTG 


ATG 


ACC 




GCT 


GGC 


GAC 


CCG 


GCG 


TTG 


GCC 


GCC 




thr ser 


pro 


tnr 


thr 


leu 


vai 


val 


met 


thr 


gly asp 


aia 


gly 


asp 


pro 


ala 


leu 


ala 


ala 




dU ±/ ZU i 


















DO 1/ Z 11 


















tT-.. 


GAT CCC 


CAA 


iCG 


TTG 






AAG 


GTC 


CTG 


'^rp 71 rp/-> 

UvjI AiC 


CaAA 


CAG 


CCC 


ACC 


ACC 


ATC 


GGC 


CAG 




asp pro 


gin 


ser 


leu 


aia 


gly 


lys 


vai 


leu 


arg ile 


glu 


gin 


pro 


UXiXT 


1-Vt r- 


-i 1 0. 

lie 


gly 


gin 




bo 1/ zZ 1 


















by 1/ z Ji 




















ACCj CCC? 












lU J. 


tin/'* 


Ax U 


fine n^cc 








1 XVj 


XljC 


7\ rp 


/—TV T" 

<oAX 


cr^c 




thr pro 


pro 


unr 


cnr 


aia 


leu 


ser 


gly 


lie 


gly ser 


gly 


gly 


gly 


leu 


cys 


-i 1 

lie 


asp 


pro 




721/241 


















751/251 




















GTC GAC 


GGC 


loo- 




rp 7\ en 
iAl 










ACG CCA 


ACG 


rice 




L-oA 


X i 


CT^C 
v.-A*jr 


cclc 


AX L 




val asp 


gly 


ser 


leu 


tyr 


va.1 


aia 


asp 


arg 


thr pro 


thr 


a1 a 

aia 


asp 


arg 


leu 


gin 


arg 


11^ 




781/261 


















811/271 




















ACC AAG 


AAC 








iv- i 


TV (— ' 






ACC TGG 


CCG 


CH^C 
uiA(— 


AAo- 


CCC 




varX Vj 


CCC 


die 




thr lys 


asn 


ser 


glu 


val 


ser 


thr 


val 


trp 


thr trp 


pro 


asp 




pro 


rri V 


val 


ala 


yxy 




841/281 


















871/291 




















TGT GCC 


GCG 


ATG 


GAC 


GGC 


ACC 


GTG 


CTG 


GTC 


AAC CTG 


ATT 


AAT 


ACC 


AAA 


CTG 


ACG 


GTG 


GCG 




cys ala 


ala 


met 


asp 


gly 


thr 


val 


leu 


val 


asn leu 


ile 


asn 


thr 


lys 


leu 


thr 


vai 


ala 




901/301 


















931/311 




















GTC CGG 


CTC 


GCG 


CCG 


TCG 


ACC 


GGT 


GCG 


GTC 


ACC GGA 


GAA 


CCC 


GAC 


GTT 


GTC 


CGC 


AAA 


GAC 




val arg 


leu 


ala 


pro 


ser 


thr 


gly 


ala 


val 


thr gly 


glu 


pro 


asp 


val 


val 


arg 


lys 


asp 




961/321 


















991/331 




















ACT CAT 


GCG 


CAT 


GCG 


TGG 


GCA 


TTA 


CGG 


ATG 


TCG CCG 


GAC 


GGC 


AAC 


GTC 


TGG 


GGA 


GCC 


ACC 




thr his 


ala 


his 


ala 


trp 


ala 


leu 


arg 


met 


ser pro 


asp 


gly 


asn 


val 


trp 


gly 


ala 


thr 




1021/341 
















1051/351 


















GTC AAC 


AAG 


ACC 


GCC 


GGC 


GAC 


GCC 


GAG 


AAG 


CTC GAC 


GAT 


GTG 


GTG 


TTC 


CCG 


CTG 


TTC 


CCG 




val asn 


lys 


thr 


ala 


gly 


asp 


ala 


glu 


lys 


leu asp 


asp 


val 


val 


phe 


pro 


leu 


phe 


pro 




1081/361 
















1111/371 


















CAG GGT 


GGC 


GGC 


TTC 


CCG 


CGC 


AAC 


AAC 


GAC 


GAC AAG 


ACC 


TGA 
















gin gly gly 


gly 


phe 


pro 


arg 


asn 


asn 


asp 


asp lys 


thr 


OPA 















SEQ ID No^39D 
FIGURE 39D 



REPLACEMENT SHEET (RULE 26) 

|ii||| l| P|l| | l | I ^ P in i l|il ^l|| p ||| ||; B|l| j|^il|^^ II P , 1 , 11 ^ 11, 



126/185 



ORF according to Cole et al . , 1998 (Nature 393 537-544) and containing Rv3006 
1/1 31/11 

TAA GGC CAT TTA GTG CCG AAT TGG GGA TTT GAG CGG CGC TTT CGC GAG ACA ATC CGC ACA 
OCH gly his leu val pro asn trp gly phe glu arg arg phe arg gin thr ile arg thr 
61/21 91/31 

TTG ACC CTG ACC AGO CCA CCA AAA GGC CCC AAT TGG GCC GCC ATG CCG ACA GTG CGC ACC 
leu thr leu thr ser pro pro lys gly pro asn trp ala ala met pro thr val arg thr 
121/41 151/51 

CCG GCA GGT GGC GGC GAT GCC CAC AAT GTC CGT AGC CTG TCG GTC ATG TGG ACA ACG CGG 
pro ala gly gly gly asp ala his asn val arg ser leu ser val met trp thr thr arg 
181/61 211/71 

TTG GTT CGA TCC GGA CTC GCC GCG CTG TGC GCG GCA GTG CTG GTA TCG AGC GGC TGC GCA 
leu val arg ser gly leu ala ala leu cys ala ala val leu val ser ser gly cys ala 
241/81 271/91 

CGG TTC AAC GAC GCT CAA TCT CAG CCG TTC ACC ACC GAA CCG GAG CTG CGG CCC CAA CCC 
arg phe asn asp ala gin ser gin pro phe thr thr glu pro glu leu arg pro gin pro 
301/101 331/111 

AGC TCG ACA CCT CCC CCC CCG CCG CCG CTG CCG CCG GTT CCC TTT CCC AAG GAA TGT CCG 
ser ser thr pro pro pro pro pro pro leu pro pro val pro phe pro lys glu cys pro 
361/121 391/131 

GCG CCG GGC GTG ATG CAA GGC TGC CTT GAG AGC ACC AGC GGC TTG ATC ATG GGC ATC GAC 
ala pro gly val met gin gly cys leu glu ser thr ser gly leu ile met gly ile asp 
421/141 451/151 

AGC AAG ACC GCA CTG GTC GCC GAG CGC ATC ACC GGT GCC GTC GAG GAG ATC TCT ATC AGC 
ser lys thr ala leu val ala glu arg ile thr gly ala val glu glu ile ser ile ser 
481/161 511/171 

GCC GAG CCG AAG GTA AAG ACG GTC ATC CCC GTG GAT CCT GCC GGT GAC GGT GGC TTG ATG 
ala glu pro lys val lys thr val ile pro val asp pro ala gly asp gly gly leu met 
541/181 571/191 

GAC ATT GTG CTG TCG CCC ACC TAC TCG CAA GAC CGG CTG ATG TAC GCC TAC ATC AGC ACG 
asp ile val leu ser pro thr tyr ser gin asp arg leu met tyr ala tyr ile ser thr 
601/201 631/211 

CCC ACC GAC AAC CGG GTG GTG CGA GTG GCC GAC GGC GAC ATC CCC AAG GAC ATC CTG ACC 
pro thr asp asn arg val val arg val ala asp gly asp ile pro lys asp ile leu thr 
661/221 691/231 

GGC ATC CCC A7^ GGT GCT GCC GGT AAC ACC GGG GCG CTG ATC TTC ACC AGT CCC ACC ACG 
gly ile pro lys gly ala ala gly asn thr gly ala leu ile phe thr ser pro thr thr 
721/241 751/251 

CTG GTC GTG ATG ACC GGG GAT GCT GGC GAC CCG GCG TTG GCC GCC GAT CCC CAA TCG TTG 
leu val val met thr gly asp ala gly asp pro ala leu ala ala asp pro gin ser leu 
781/261 811/271 

GCC GGT AAG GTC CTG CGT ATC GAA CAG CCC ACC ACC ATC GGC CAG ACG CCG CCG ACG ACG 
ala gly lys val leu arg ile glu gin pro thr thr ile gly gin thr pro pro thr thr 
841/281 871/291 

GCG CTG TCT GGC ATC GGC TCC GGC GGC GGC TTG TGC ATC GAT CCG GTC GAC GGC TCG CTA 
ala leu ser gly ile gly ser gly gly gly leu cys ile asp pro val asp gly ser leu 
901/301 931/311 

TAT GTC GCC GAC CGC ACG CCA ACG GCG GAC CGA TTG CAG CGC ATC ACC AAG AAC TCG GAG 
tyr val ala asp arg thr pro thr ala asp arg leu gin arg ile thr lys asn ser glu 



SEQ ID No,39F 
FIGURE 39F 



REPLACEMENT SHEET (RULE 26) 



127/185 



961/321 
















991/331 
















GTC TCT ACG 


GTA 


TGG 


ACC 


TGG 


CCG 


GAC 


AAG 


CCC GGC GTG 


GCC 


GGG 


TGT 


GCC 


GCG 


ATG 


GAC 


val ser thr 


val 


trp 


thr 


trp 


pro 


asp 


lys 


pro gly val 


ala 


gly 


cys 


ala 


ala 


met 


asp 


1021/341 














1051/351 
















GGC ACC GTG 


CTG 


GTC 


AAC 


CTG 


ATT 


AAT 


ACC 


AAA CTG ACG 






(jX 






GCG 


CCG 


gly thr val 


leu 


val 


asn 


leu 


ile 


asn 


thr 


lys leu thr 


val 


ala 


va± 


arg 


leu 


ala 


pro 


1081/361 
















1111/371 














GCG 


TCG ACC GGT 


GCG 


GTC 


ACC 


GGA 


GAA 


CCC 


GAC 


GTT GTC CGC 


AAA 


GAC 


ACT 


CAT 


GCG 


CAT 


ser thr gly 


ala 


val 


thr 


gly 


glu pro 


asp 


val val arg 


lys 


asp 


thr 


his 


ala 


his 


ala 


1141/381 
















1171/391 
















TGG GCA TTA 


CGG 


ATG 


TCG 


CCG 


GAC 


GGC 


AAC 


GTC TGG GGA 


GCC 


ACC 


GTC 


AAC 


AAG 


ACC 


GCC 


trp ala leu 


arg 


met 


ser 


pro 


asp 


gly 


asn 


val trp gly 


ala 


thr 


val 


asn 


lys 


thr 


ala 


1201/401 














1231/411 
















GGC GAC GCC 


GAG 


AAG 


CTC 


GAC 


GAT 


GTG 


GTG 


TTC CCG CTG 


TTC 


CCG 


CAG 


GGT 


GGC 


GGC 


TTC 


gly asp ala 


glu 


lys 


leu 


asp 


asp 


val 


val 


phe pro leu 


phe 


pro 


gin 


gly 


gly 


gly phe 


1261/421 
































CCG CGC AAC 


AAC 


GAC 


GAC 


AAG 


ACC 


TGA 




















pro arg asn 


asn 


asp 


asp 


lys 


thr 


OPA 





















SEQ ID No,39F (continued) 
FIGURE 39F (continued) 



1/1 

GAA GGC CTT GTT 
glu gly leu val 
61/21 

TCG GTT GAA CGT 
ser val glu arg 
121/41 

CGT CTC GGC GCT 
arg leu gly ala 
181/61 

GTG GCT TGA GGG 
val ala OPA gly 
241/81 

GAT CGA TGC CGC 
asp arg cys arg 
301/101 

CAG GTG GTC GTC 
gin val val val 



GAG CCG GCG CAC 
glu pro ala his 

GTA TGT GCC CGA 
val cys ala arg 

GAC TCA GGC CGC 
asp ser gly arg 

GTT GGA ACC CAG 
val gly thr gin 

TCG CGA TGA GTT 
ser arg OPA val 

GAC GCG AGT GCC 
asp ala ser ala 







31/11 


GAA 


AAC 


GAT CGT 


glu 


asn 


asp arg 






91/31 


CGA 


ATT 


GGC GGA 


arg 


ile 


gly gly 






151/51 


GAT 


CAG 


TGC CGA 


asp 


gin 


cys arg 






211/71 


AAG 


CAC 


CGG CGC 


lys 


his 


arg arg 






271/91 


CGA 


AGC 


GTG AGA 


arg 


ser 


val arg 






331/111 


ATG 


GTG 


GAT C 


met 


val 


asp 



TGT GTG TAC ATT 
cys val tyr ile 

GCG CGC CAG GGC 
ala arg gin gly 

GTT GGA GAA CTC 
val gly glu leu 

TCG GCA TGA TGA 
ser ala OPA OPA 

GCA TCG CCC ACT 
ala ser pro thr 



GGT GTG TAT GGC 
gly val tyr gly 

GCG GGG CTT GAA 
ala gly leu glu 

CGC AAC CGA TGC 
arg asn arg cys 

CGT GCT GGG TGC 
arg ala gly cys 

TCG CCG CCG GAG 
ser pro pro glu 



SEQ ID No,40A 
FIGURE 4 OA 



REPLACEMENT SHEET (RULE 26) 



128/185 



1/1 

AAG GCC TTG TTG 
lys ala leu leu 
61/21 

CGG TTG AAC GTG 
arg leu asn val 
121/41 

GTC TCG GCG CTG 
val ser ala leu 
181/61 

TGG CTT GAG GGG 
trp leu glu gly 
241/81 

ATC GAT GCC GCT 
ile asp ala ala 
301/101 

AGG TGG TCG TCG 
arg trp ser ser 



AGC CGG CGC ACG 
ser arg arg thr 

TAT GTG CCC GAC 
tyr val pro asp 

ACT CAG GCC GCG 
thr gin ala ala 

TTG GAA CCC AGA 
leu glu pro arg 

CGC GAT GAG TTC 
arg asp glu phe 

ACG CGA GTG CCA 
thr arg val pro 



31/11 



AAA 


ACG 


ATC GTT 


lys 


thr 


ile val 






91/31 


GAA 


TTG 


GCG GAG 


glu 


leu 


ala glu 






151/51 


ATC 


AGT 


GCC GAG 


ile 


ser 


ala glu 






211/71 


AGC 


ACC 


GGC GCT 


ser 


thr 


gly ala 






271/91 


GAA 


GCG 


TGA GAG 


glu 


ala 


OPA glu 






331/111 


TGG 


TGG 


ATC 


trp 


trp 


ile 



SEQ ID No. 4 OB 
FIGURE 40B 



GTG TGT ACA TTG 
val cys thr leu 

CGC GCC AGG GCG 
arg ala arg ala 

TTG GAG AAC TCC 
leu glu asn ser 

CGG CAT GAT GAC 
arg his asp asp 

CAT CGC CCA CTT 
is arg pro leu 



GTG TGT ATG GCT 
val cys met ala 

CGG GGC TTG AAC 
arg gly leu asn 

GCA ACC GAT GCG 
ala thr asp ala 

GTG CTG GGT GCG 
val leu gly ala 

CGC CGC CGG AGC 
arg arg arg ser 



1/1 

AGG CCT TGT TGA GCC 
arg pro cys OPA ala 
61/21 

GGT TGA ACG TGT ATG 
gly OPA thr cys met 
121/41 

TCT CGG CGC TGA CTC 
ser arg arg OPA leu 
181/61 

GGC TTG AGG GGT TGG 
gly leu arg gly trp 
241/81 

TCG ATG CCG CTC GCG 
ser met pro leu ala 
301/101 

GGT GGT CGT CGA CGC 
gly gly arg arg arg 



GGC GCA CGA AAA CGA 
gly ala arg lys arg 

TGC CCG ACG AAT TGG 
cys pro thr asn trp 

AGG CCG CGA TCA GTG 
arg pro arg ser val 

AAC CCA GAA GCA CCG 
asn pro glu ala pro 

ATG AGT TCG AAG CGT 
met ser ser lys arg 

GAG TGC CAT GGT GGA 
glu cys his gly gly 



31/11 

TCG TTG TGT GTA CAT 
ser leu cys val his 
91/31 

CGG AGC GCG CCA GGG 
arg ser ala pro gly 
151/51 

CCG AGT TGG AGA ACT 
pro ser trp arg thr 
211/71 

GCG CTC GGC ATG ATG 
ala leu gly met met 
271/91 

GAG AGC ATC GCC CAC 
glu ser ile ala his 

TC 



TGG TGT GTA TGG CTC 
trp cys val trp leu 

CGC GGG GCT TGA ACG 
arg gly ala OPA thr 

CCG CAA CCG ATG CGT 
pro gin pro met arg 

ACG TGC TGG GTG CGA 
thr cys trp val arg 

TTC GCC GCC GGA GCA 
phe ala ala gly ala 



SEQ ID No. 4 DC 
FIGURE 40C 



REPLACEMENT SHEET (RULE 26) 



129/185 



Coding sequence Rv0549c predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq4 0A 

31/11 

gtg aga gca teg ccc act teg ccg ccg gag cag gtg gtc gte gac gcg agt gcc atg gtg 
val arg ala ser pro thr ser pro pro glu gin val val val asp ala ser ala met val 
61/21 91/31 

gat eta ctg get cgc act age gat egg tgc tet gcg gtg cgc gcg egg ctg get egg ace 
asp leu leu ala arg thr ser asp arg eys ser ala val arg ala arg leu ala arg thr 
121/41 151/51 

gcg atg eac gcg ccg gcg cac ttc gat gca gag gtg ttg teg gcg ctg ggg cgc atg cag 
ala met his ala pro ala his phe asp ala glu val leu ser ala leu gly arg met gin 
181/61 211/71 

cgc gcc ggc gca etc acc gtt gcc tat gtc gat gcg gca ctg gag gag ttg cga cag gtg 
arq ala gly ala leu thr val ala tyr val asp ala ala leu glu glu leu arg gin val 
241/81 271/91 

ccg gtg act cga eac ggt ett teg teg ctg ctt get gga gcg tgg teg cgc cgc gac acc 
pro val thr arg his gly leu ser ser leu leu ala gly ala trp ser arg arg asp thr 
301/101 331/111 

etc cgc ctg acc gat gcc etc tac gtc gag ctg gcc gaa acg gca ggt ctg gtg ttg ttg 
leu arg leu thr asp ala leu tyr val glu leu ala glu thr ala gly leu val leu leu 
361/121 391/131 

acc acc gac gaa aga ttg gca cgc gee tgg ccc teg get cac gee ate ggc tga 
thr thr asp glu arg leu ala arg ala trp pro ser ala his ala lie gly OPA 

SEQ ID No,40D 
FIGURE 40D 

ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv0549c 
1/1 



61/21 
agt gcc 
ser ala 
121/41 
ctg get 
leu ala 
181/61 

ggg cgc 

gly arg 
241/81 
ttg cga 
leu arg 
301/101 
cgc cgc 
arg arg 
361/121 



421/141 
ggc tga 
gly OPA 



















31/11 














cga 
arg 


age 
ser 


gtg 
val 


aga 
arg 


gca 
ala 


teg 
ser 


ccc 
pro 


act 
thr 


teg ccg 
ser pro 
91/31 


ccg 
pro 


gag 
glu 


cag 
gin 


gtg 
val 


gtc 
val 


gtc 
val 


atg 
met 


gtg 
val 


gat 
asp 


eta 
leu 


ctg 
leu 


get 
ala 


cgc 
arg 


act 
thr 


age gat 
ser asp 
151/51 


egg 
arg 


tgc 
eys 


tet 
ser 


gcg 
ala 


gtg 
val 


cgc 
arg 


egg 
arg 


acc 
thr 


gcg 
ala 


atg 
met 


cac 
his 


gcg 
ala 


ccg 
pro 


gcg 
ala 


cac ttc 
his phe 
211/71 


gat 
asp 


gca 
ala 


gag 
glu 


gtg 
val 


ttg 
leu 


teg 
ser 


atg 
met 


cag 
gin 


cgc 
arg 


gcc 
ala 


ggc 
gly 


gca 
ala 


etc 
leu 


ace 
thr 


gtt gee 
val ala 
271/91 


tat 
tyr 


gte 
val 


gat 
asp 


gcg 
ala 


gca 
ala 


ctg 
leu 


cag 
gin 


gtg 
val 


ccg 
pro 


gtg 
val 


act 
thr 


cga 
arg 


cac 
his 


ggt 
gly 


ett teg 
leu ser 
331/111 


teg 
ser 


ctg 
leu 


ctt 
leu 


get 
ala 


gga 
gly 


gcg 
ala 


gac 
asp 


acc 
thr 


etc 
leu 


cgc 
arg 


ctg 
leu 


acc 
thr 


gat 
asp 


gcc 
ala 


etc tac 
leu tyr 
391/131 


gtc 
val 


gag 
glu 


ctg 
leu 


gcc 
ala 


gaa 
glu 


acg 
thr 


ttg 
leu 


ttg 
leu 


acc 
thr 


acc 
thr 


gac 
asp 


gaa 
glu 


aga 
arg 


ttg 
leu 


gca cgc 
ala arg 


gcc 
ala 


tgg 
trp 


ccc 
pro 


teg 
ser 


get 
ala 


cac 
his 



egg 



SEQ ID No,40F 



FIGURE 40F 
REPLACEMENT SHEET (RULE 26) 



130/185 



1/1 

CCT GGC CGG GAC GCC TAC GTG 
pro gly arg asp ala tyr val 
61/21 

AGC GCC AAA ACG ATC AGC CCT 
ser ala lys thr lie ser pro 
121/41 

CGT CGT GAC CGT GCT GTC ACC 
arg arg asp arg ala val thr 
181/61 

GGT GTG TTG CGG AGG AGG TGT 
gly val leu arg arg arg cys 
241/81 

GCC TTG CGG GAC TGG GCA CAC 
ala leu arg asp trp ala his 









31/11 




TAG 


CCC 


GCG 


GCT AGC 


ACA 


AMB 


pro 


ala 


ala ser 


thr 








91/31 




TCG 


CGG 


ACA 


TGT CAG 


CAC 


ser 


arg 


thr 


cys gin 


his 








151/51 




ACG 


TCT 


GGT 


TAG GCT 


CGG 


thr 


ser 


gly 


AMB ala 


arg 








211/71 




GTT 


GTA 


GTG 


GGG ACG 


GCG 


val 


val 


val 


gly thr 


ala 








271/91 




GCC 


GTC 


GTC 


AGC GAT 


C 


ala 


val 


val 


ser asp 





SEQ ID No,41A 
FIGURE 41A 



GGA TAG CCA TTG TTG TGC GGT 
gly AMB pro leu leu cys gly 

CCG CCT TGG CCG GGA GAG CGG 
pro pro trp pro gly glu arg 

GGC GCG GGC TGG CGC GGA GGA 
gly ala gly trp arg gly gly 

GAT CGG CCG TTG GAC GCC TCG 
asp arg pro leu asp ala ser 



1/1 

CTG GCC GGG ACG CCT 
leu ala gly thr pro 

61/21 

GCG CCA AAA CGA TCA 
ala pro lys arg ser 
121/41 

GTC GTG ACC GTG CTG 
val val thr val leu 
181/61 

GTG TGT TGC GGA GGA 
val cys cys gly gly 
241/81 

CCT TGC GGG ACT GGG 
pro cys gly thr gly 



ACG TGT AGC CCG CGG 
thr cys ser pro arg 

GCC CTT CGC GGA CAT 
ala leu arg gly his 

TCA CCA CGT CTG GTT 
ser pro arg leu val 

GGT GTG TTG TAG TGG 
gly val leu AMB trp 

CAC ACG CCG TCG TCA 
his thr pro ser ser 



31/11 

CTA GCA CAG GAT AGC 
leu ala gin asp ser 
91/31 

GTC AGC ACC CGC CTT 
val ser thr arg leu 
151/51 

AGG CTC GGG GCG CGG 
arg leu gly ala arg 
211/71 

GGA CGG CGG ATC GGC 
gly arg arg ile gly 
271/91 
GCG ATC 
ala ile 



CAT TGT TGT GCG GTA 
his cys cys ala val 

GGC CGG GAG AGC GGC 
gly arg glu ser gly 

GCT GGC GCG GAG GAG 
ala gly ala glu glu 

CGT TGG ACG CCT CGG 
arg trp thr pro arg 



SEQ ID No. 4 IB 
FIGURE 4 IB 



REPLACEMENT SHEET (RULE 26) 



131/185 



1/1 

TGG CCG GGA CGC CTA CGT GTA 
trp pro gly arg leu arg val 
61/21 

CGC CAA AAC GAT CAG CCC TTC 
arg gin asn asp gin pro phe 
121/41 

TCG TGA CCG TGC TGT CAC CAC 
ser OPA pro cys cys his his 
181/61 

TGT GTT GCG GAG GAG GTG TGT 
cys val ala glu glu val cys 
241/81 

CTT GCG GGA CTG GGC ACA CGC 
leu ala gly leu gly thr arg 









31/11 




GCC 


CGC 


GGC 


TAG CAC 


AGG 


ala 


arg 


gly 


AMB his 


arg 








91/31 




GCG 


GAC 


ATG 


TCA GCA 


CCC 


ala 


asp 


met 


ser ala 


pro 








151/51 




GTC 


TGG 


TTA 


GGC TCG 


GGG 


val 


trp 


leu 


gly ser 


gly 








211/71 




TGT 


AGT 


GGG 


GAC GGC 


GGA 


cys 


ser 


gly 


asp gly 


gly 








271/91 




CGT 


CGT 


CAG 


CGA TC 




arg 


arg 


gin 


arg 





SEQ ID No,41C 
FIGURE 41C 



ATA GCC ATT GTT GTG CGG TAG 
ile ala ile val val arg AMB 

GCC TTG GCC GGG AGA GCG GCG 
ala leu ala gly arg ala ala 

CGC GGG CTG GCG CGG AGG AGG 
arg gly leu ala arg arg arg 

TCG GCC GTT GGA CGC CTC GGC 
ser ala val gly arg leu gly 



Coding sequence Rv2975c predicted by 
containing seq41A 



1/1 



gtg ggg 


acg 


gcg 


gat 


egg 


ccg 


ttg 


gac 


gcc 


val gly 


thr 


ala 


asp 


arg 


pro 


leu 


asp 


ala 


61/21 


















gtc age 


gat 


ctg 


ate 


etc 


cac 


ate 


gac 


gag 


val ser 


asp 


leu 


ile 


leu 


his 


ile 


asp 


glu 


121/41 


















gac tec 


gat 


acc 


ggc 


gtc 


aac 


atg 


ctg 


ttc 


asp ser 


asp 


thr 


gly 


val 


asn 


met 


leu 


phe 


181/61 


















ttg cac 


gcg 


aat 


teg 


cag 


get 


gac 


gcc 


gaa 


leu his 


ala 


asn 


ser 


gin 


ala 


asp 


ala 


glu 


241/81 


















gcc ggc 


gcg 


cgt 


tga 












ala gly 


ala 


arg 


OPA 













e et al, 1998 (Nature 393: 537-544) and 



31/11 

teg gcc ttg egg gac tgg gea cac gee gtc 
ser ala leu arg asp trp ala his ala val 
91/31 

ate aac egg etc aat gtg ttc ccg gtc get 
ile asn arg leu asn val phe pro val ala 
151/51 

acc atg cgt gcc gcg gtc gta gaa get gat 
thr met arg ala ala val val glu ala asp 
211/71 

gac gtg gcg egg gtt gcg gee get etc gcg 
asp val ala arg val ala ala ala leu ala 



SEQ ID No, 4 ID 



FIGURE 4 ID 



REPLACEMENT SHEET (RULE 26) 



132/185 



ORF according to Cole et al, 1998 {Nature 393: 537-544) and containing Rv2975c 
1/1 31/11 

tag get egg ggc gcg ggc tgg cgc gga gga ggt gtg ttg egg agg agg tgt gtt gta gtg 
AMB ala arg gly ala gly trp arg gly gly gly val leu arg arg arg cys val val val 
61/21 91/31 

ggg acg gcg gat egg ccg ttg gac gee teg gee ttg egg gac tgg gea eac gee gtc gtc 
gly thr ala asp arg pro leu asp ala ser ala leu arg asp trp ala his ala val val 
121/41 151/51 

age gat ctg ate etc eac ate gac gag ate aac egg etc aat gtg ttc eeg gtc get gac 
ser asp leu ile leu his ile asp glu ile asn arg leu asn val phe pro val ala asp 
181/61 211/71 , , 

tec gat ace ggc gtc aac atg ctg ttc acc atg cgt gee gcg gtc gta gaa get gat ttg 
ser asp thr gly val asn met leu phe thr met arg ala ala val val glu ala asp leu 
241/81 271/91 

eac gcg aat teg eag get gac gee gaa gac gtg gcg egg gtt gcg gee get etc gcg gee 
his ala asn ser gin ala asp ala glu asp val ala arg val ala ala ala leu ala ala 
301/101 

ggc gcg cgt tga 
gly ala arg OPA 

SEQ ID No. 4 IF 
FIGURE 41F 



sequence Rv 2974C predicted by Cole et al . 
in the same reading frame as Seq41D* The 
one case out of three, a deletion of two 
in 



(Nature 393:537-544) and which may be 
sequencing of this region reveals, in 
nucleotides putting in phase observed 



1/1 

ttg aac gga get cgc ggc 
leu asn gly ala arg gly 
61/21 

gag gtg acc gcg act gcg 
glu val thr ala thr ala 
121/41 

gcc etc ggg gee gcg ttg 
ala leu gly ala ala leu 
181/61 

gag gtg ccg gga act ate 
glu val pro gly thr ile 
241/81 

gcg eac gag ggg ttg gee 
ala his glu gly leu ala 
301/101 

gaa aag ace ecc gaa cag 
glu lys thr pro glu gin 











31/11 




aac 


tec 


ggc 


gtg 


ate ctg 


tec 


asn 


ser 


gly 


val 


ile leu 


ser 










91/31 




gcc 


gcc 


gcc 


tct 


ggc gcg 


gta 


ala 


ala 


ala 


ser 


gly ala 


val 










151/51 




tgg 


cgc 


ggc 


gtc 


gag ttg 


gtc 


trp 


arg 


gly 


val 


glu leu 


val 










211/71 




gtc 


teg 


gtg 


ctg 


egg gcc 


gcc 


val 


ser 


val 


leu 


arg ala 


ala 










271/91 




ggt 


gcg 


gtc 


acc 


gcc gcc 


ggt 


gly 


ala 


val 


thr 


ala ala 


gly 








331/111 




ctt 


gac 


gtg 


etc 


gcc gat 


gcg 


leu 


asp 


val 


leu 


ala asp 


ala 



SEQ ID No. 4 IS 
FIGURE 41S 



cag ate ctg cgc ggg ate gca 
gin ile leu arg gly ile ala 

ttg egg gcg gtc gac gcc aac 
leu arg ala val asp ala asn 

gtc gcg teg atg ggt ggc gtg 
val ala ser met gly gly val 

gcc gga gcc gtc gac cag tgc 
ala gly ala val asp gin cys 

gac gcg gcg gtc ate gcg ctg 
asp ala ala val ile ala leu 

ggc gcg gtg gac gcc ggc gga 
gly ala val asp ala gly gly 



REPLACEMENT SHEET (RULE 26) 



133/185 



361/121 391/131 

egg ggc ctg ctg gtt ctg ctg gac gcg ttg cgc tec acc ate tgc ggg cag gca cct gee 
arg gly leu leu val leu leu asp ala leu arg ser thr lie cys gly gin ala pro ala 
421/141 451/151 

egg gcg gtc tae gaa ecc teg ccg cgc gcg ttg ccg acc gac aeg get acc caa cgc ecc 
arg ala val tyr glu pro ser pro arg ala leu pro thr asp thr ala thr gin arg pro 
481/161 511/171 

gcc ccg caa ttc gag gtg atg tat ctg ttg gcg gta tgt gat get gca gcg gcg gac cag 
ala pro gin phe glu val met tyr leu leu ala val cys asp ala ala ala ala asp gin 
541/181 571/191 

ttg egg gat cga etc aag gaa ttg ggt gag teg gtg gcc ate gee get get ccg ecc gac 
leu arg asp arg leu lys glu leu gly glu ser val ala lie ala ala ala pro pro asp 
601/201 631/211 

age tac tec gta cac gtc cac acc gac gac gcc ggt gcc gcc gtg gaa gcc gga ttg gcg 
ser tyr ser val his val his thr asp asp ala gly ala ala val glu ala gly leu ala 
661/221 691/231 

gtg ggg cga gtt age egg ate gtg ate teg gcg etc ggt tec ggg acc age gga ttg ccg 
val gly arg val ser arg ile val ile ser ala leu gly ser gly thr ser gly leu pro 
721/241 751/251 

gcc ggt ggc tgg aeg egg ggc cgc gee gtg ctg gcg gtc gtc gac ggc gac ggt gcc gcc 
ala gly gly trp thr arg gly arg ala val leu ala val val asp gly asp gly ala ala 
781/261 811/271 

gag ctg ttc gcc ggg gag ggc gcc tgc gtg ctg cga ccg ggt cca gac gcc gtg aca ccg 
glu leu phe ala gly glu gly ala cys val leu arg pro gly pro asp ala val thr pro 
841/281 871/291 

gee gcc gat ate agt gcc cac cag ctg gtg egg gcc gtg gta gac acc ggc gcc gcg cac 
ala ala asp ile ser ala his gin leu val arg ala val val asp thr gly ala ala his 
901/301 931/311 

gtg atg gtg ctg ecc aat ggc tat gtg gee gcc gaa gaa ctg gtg gcc ggg tgt acc gcg 
val met val leu pro asn gly tyr val ala ala glu glu leu val ala gly cys thr ala 
961/321 991/331 

gcg ate ggc tgg ggc gtc gac gtg gta ecc gtg ccg acc gga teg atg gtg cag ggg ttg 
ala ile gly trp gly val asp val val pro val pro thr gly ser met val gin gly leu 
1021/341 1051/351 

gcc gcg ctg gcc gtg cat gac gcg gcc cgc cag gcc gtc gac gac ggc tac age atg gcc 
ala ala leu ala val his asp ala ala arg gin ala val asp asp gly tyr ser met ala 
1081/361 1111/371 

cgt gcc gcc ggt get tec egg cac gga teg gtg cgc att gcc acc caa aag gcg ctg acc 
arq ala ala gly ala ser arg his gly ser val arg ile ala thr gin lys ala leu thr 
1141/381 1171/391 

tgg gcc ggt acc tgc aag ccg ggc gac ggt ctg ggt ate gcg ggc gac gag gtg ctg ate 
trp ala gly thr cys lys pro gly asp gly leu gly ile ala gly asp glu val leu lie 
1201/401 1231/411 

gtc gee gac gat gtc gcc gcg gcg gcc ate ggt ctg gtc gac ctg ttg ttg gca teg gga 
val ala asp asp val ala ala ala ala ile gly leu val asp leu leu leu ala ser gly 
1261/421 1291/431 

ggc gat ctg gtg aeg gtg eta att ggc gcc ggc gta acc gaa gac gtg get gtc gtc ctg 
gly asp leu val thr val leu ile gly ala gly val thr glu asp val ala val val leu 
1321/441 1351/451 

gaa egg cat gtg cac gac cac cat cca ggc acc gag ctg gtc tec tac cgc acc gga cac 
glu arg his val his asp his his pro gly thr glu leu val ser tyr arg thr gly his 
1381/461 1411/471 
cgc ggc gac gcg ctg ctg ate ggg gtc gag tag 
arg gly asp ala leu leu ile gly val glu AMB 



SEQ ID No. 4 IS (continued) 
FIGURE 4 IS (continued) 



REPLACEMENT SHEET (TtULE 26) 
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Seq41T comprising seq 41F and seq 41S 
1/1 31/11 

tta ggc teg ggg cgc ggg ctg gcg egg agg agg tgt gtt gcg gag gag gtg tgt tgt agt 
leu gly ser gly arg gly leu ala arg arg arg cys val ala glu glu val cys cys ser 
AMB ala arg gly ala gly trp arg gly gly gly val leu arg arg arg cys val val val 
arg leu gly ala arg ala gly ala glu glu val cys cys gly gly gly val leu AMB trp 
61/21 91/31 

ggg gac ggc gga teg gcc gtt gga cgc etc ggc ctt gcg gga ctg ggc aca cgc cgt cgt 
gly asp gly gly ser ala val gly arg leu gly leu ala gly leu gly thr arg arg arg 
gly thr ala asp arg pro leu asp ala ser ala leu arg asp trp ala his ala val val 
gly arg arg ile gly arg trp thr pro arg pro cys gly thr gly his thr pro ser ser 
121/41 151/51 

cag cga tct gat cct cca cat ega cga gat caa ccg get caa tgt gtt ccc ggt cgc tga 
gin arg ser asp pro pro his arg arg asp gin pro ala gin cys val pro gly arg OPA 
ser asp leu ile leu his ile asp glu ile asn arg leu asn val phe pro val ala asp 
ala ile OPA ser ser thr ser thr arg ser thr gly ser met cys ser arg ser leu thr 
181/61 211/71 

etc cga tac egg cgt caa cat get gtt cae cat gcg tgc cgc ggt cgt aga age tga ttt 
leu arg tyr arg arg gin his ala val his his ala cys arg gly arg arg ser OPA phe 
ser asp thr gly val asn met leu phe thr met arg ala ala val val glu ala asp leu 
pro ile pro ala ser thr cys cys ser pro cys val pro arg ser AMB lys leu ile cys 
241/81 271/91 

gea cgc gaa ttc gea ggc tga cgc cga aga cgt ggc gcg ggt tgc ggc cgc tct cgc ggc 
ala arg glu phe ala gly OPA arg arg arg arg gly ala gly cys gly arg ser arg gly 
his ala asn ser gin ala asp ala glu asp val ala arg val ala ala ala leu ala ala 
thr arg ile arg arg leu thr pro lys thr trp arg gly leu arg pro leu ser arg pro 
301/101 331/111 

egg cgc gcg ttg aac gga get cgc ggc aac tec ggc gtg ate ctg tec cag ate ctg cgc 
arg arg ala leu asn gly ala arg gly asn ser gly val ile leu ser gin ile leu arg 
gly ala arg OPA thr glu leu ala ala thr pro ala OPA ser cys pro arg ser cys ala 
ala arg val glu arg ser ser arg gin leu arg arg asp pro val pro asp pro ala arg 
361/121 391/131 

ggg ate gea gag gtg acc gcg act gcg gcc gcc gcc tct ggc gcg gta ttg egg gcg gtc 
gly ile ala glu val thr ala thr ala ala ala ala ser gly ala val leu arg ala val 
gly ser gin arg OPA pro arg leu arg pro pro pro leu ala arg tyr cys gly arg ser 
asp arg arg gly asp arg asp cys gly arg arg leu trp arg gly ile ala gly gly arg 
421/141 451/151 

gac gcc aac gcc etc ggg gcc gcg ttg tgg cgc ggc gtc gag ttg gtc gtc gcg teg atg 
asp ala asn ala leu gly ala ala leu trp arg gly val glu leu val val ala ser met 
thr pro thr pro ser gly pro arg cys gly ala ala ser ser trp ser ser arg arg trp 
arg gin arg pro arg gly arg val val ala arg arg arg val gly arg arg val asp gly 
481/161 511/171 

ggt ggc gtg gag gtg ccg gga act ate gtc teg gtg ctg egg gcc gcc gcc gga gcc gtc 
gly gly val glu val pro gly thr ile val ser val leu arg ala ala ala gly ala val 
val ala trp arg cys arg glu leu ser ser arg cys cys gly pro pro pro glu pro ser 
trp arg gly gly ala gly asn tyr arg leu gly ala ala gly arg arg arg ser arg arg 
541/181 571/191 

gac cag tgc gcg cac gag ggg ttg gee ggt gcg gtc acc gcc gcc ggt gac gcg gcg gtc 
asp gin cys ala his glu gly leu ala gly ala val thr ala ala gly asp ala ala val 
thr ser ala arg thr arg gly trp pro val arg ser pro pro pro val thr arg arg ser 
pro val arg ala arg gly val gly arg cys gly his arg arg arg OPA arg gly gly his 
601/201 631/211 

ate gcg ctg gaa aag acc ccc gaa cag ctt gac gtg etc gcc gat gcg ggc gcg gtg gac 
ile ala leu glu lys thr pro glu gin leu asp val leu ala asp ala gly ala val asp 
ser arg trp lys arg pro pro asn ser leu thr cys ser pro met arg ala arg trp thr 
arg ala gly lys asp pro arg thr ala OPA arg ala arg arg cys gly arg gly gly arg 
661/221 691/231 

gcc ggc gga egg ggc ctg ctg gtt ctg ctg gac gcg ttg cgc tec acc ate tgc ggg cag 
ala gly gly arg gly leu leu val leu leu asp ala leu arg ser thr ile cys gly gin 
pro ala asp gly ala cys trp phe cys trp thr arg cys ala pro pro ser ala gly arg 
arg arg thr gly pro ala gly ser ala gly arg val ala leu his his leu arg ala gly 

SEQ ID No, 4 IT 
FIGURE 41T 



REPLACEMENT SHEET (RULE 26) 
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721/241 751/251 

gca cct gcc egg gcg gtc tac gaa ccc teg ccg cgc gcg ttg ccg acc gac acg get acc 
ala pro ala arg ala val tyr glu pro ser pro arg ala leu pro thr asp thr ala thr 
his leu pro gly arg ser thr asn pro arg arg ala arg cys arg pro thr arg leu pro 
thr cys pro gly gly leu arg thr leu ala ala arg val ala asp arg his gly tyr pro 
781/261 811/271 

caa cgc ccc gcc ccg caa ttc gag gtg atg tat ctg ttg gcg gta tgt gat get gca gcg 
gin arg pro ala pro gin phe glu val met tyr leu leu ala val cys asp ala ala ala 
asn ala pro pro arg asn ser arg OPA cys ile cys trp arg tyr val met leu gin arg 
thr pro arg pro ala ile arg gly asp val ser val gly gly met OPA cys cys ser gly 
841/281 871/291 

gcg gac cag ttg egg gat cga etc aag gaa ttg ggt gag teg gtg gee ate gcc get get 
ala asp gin leu arg asp arg leu lys glu leu gly glu ser val ala ile ala ala ala 
arg thr ser cys gly ile asp ser arg asn trp val ser arg trp pro ser pro leu leu 
gly pro val ala gly ser thr gin gly ile gly OPA val gly gly his arg arg cys ser 
901/301 931/311 

ccg ccc gac age tac tec gta cac gtc cac acc gac gac gee ggt gcc gee gtg gaa gee 
pro pro asp ser tyr ser val his val his thr asp asp ala gly ala ala val glu ala 
arg pro thr ala thr pro tyr thr ser thr pro thr thr pro val pro pro trp lys pro 
ala arg gin leu leu arg thr arg pro his arg arg arg arg cys arg arg gly ser arg 
961/321 991/331 

gga ttg gcg gtg ggg cga gtt age egg ate gtg ate teg gcg etc ggt tec ggg acc age 
gly leu ala val gly arg val ser arg ile val ile ser ala leu gly ser gly thr ser 
asp trp arg trp gly glu leu ala gly ser OPA ser arg arg ser val pro gly pro ala 
ile gly gly gly ala ser AMB pro asp arg asp leu gly ala arg phe arg asp gin arg 
1021/341 1051/351 

gga ttg ccg gcc ggt ggc tgg acg egg ggc cgc gcc gtg ctg gcg gtc gtc gac ggc gac 
gly leu pro ala gly gly trp thr arg gly arg ala val leu ala val val asp gly asp 
asp cys arg pro val ala gly arg gly ala ala pro cys trp arg ser ser thr ala thr 
ile ala gly arg trp leu asp ala gly pro arg arg ala gly gly arg arg arg arg arg 
1081/361 1111/371 

ggt gcc gcc gag ctg ttc gee ggg gag ggc gcc tgc gtg ctg cga ccg ggt cca gac gee 
gly ala ala glu leu phe ala gly glu gly ala cys val leu arg pro gly pro asp ala 
val pro pro ser cys ser pro gly arg ala pro ala cys cys asp arg val gin thr pro 

cys arg arg ala val arg arg gly gly arg leu arg ala ala thr gly ser arg arg arg 
1141/381 1171/391 

gtg aca ccg gee gcc gat ate agt gcc cac cag ctg gtg egg gcc gtg gta gac acc ggc 
val thr pro ala ala asp ile ser ala his gin leu val arg ala val val asp thr gly 
OPA his arg pro pro ile ser val pro thr ser trp cys gly pro trp AMB thr pro ala 
asp thr gly arg arg tyr gin cys pro pro ala gly ala gly arg gly arg his arg arg 
1201/401 1231/411 

gee gcg cac gtg atg gtg ctg ccc aat ggc tat gtg gcc gcc gaa gaa ctg gtg gcc ggg 
ala ala his val met val leu pro asn gly tyr val ala ala glu glu leu val ala gly 
pro arg thr OPA trp cys cys pro met ala met trp pro pro lys asn trp trp pro gly 
arg ala arg asp gly ala ala gin trp leu cys gly arg arg arg thr gly gly arg val 
1261/421 1291/431 

tgt acc gcg gcg ate ggc tgg ggc gtc gac gtg gta ccc gtg ccg acc gga teg atg gtg 
cys thr ala ala ile gly trp gly val asp val val pro val pro thr gly ser met val 
val pro arg arg ser ala gly ala ser thr trp tyr pro cys arg pro asp arg trp cys 

tyr arg gly asp arg leu gly arg arg arg gly thr arg ala asp arg ile asp gly ala 
1321/441 1351/451 

cag ggg ttg gcc gcg ctg gcc gtg cat gac gcg gcc cgc cag gcc gtc gac gac ggc tac 
gin gly leu ala ala leu ala val his asp ala ala arg gin ala val asp asp gly tyr 
arg gly trp pro arg trp pro cys met thr arg pro ala arg pro ser thr thr ala thr 
gly val gly arg ala gly arg ala OPA arg gly pro pro gly arg arg arg arg leu gin 
1381/461 1411/471 

age atg gcc cgt gcc gcc ggt get tec egg cac gga teg gtg cgc att gee ace caa aag 
ser met ala arg ala ala gly ala ser arg his gly ser val arg ile ala thr gin lys 
ala trp pro val pro pro val leu pro gly thr asp arg cys ala leu pro pro lys arg 
his gly pro cys arg arg cys phe pro ala arg ile gly ala his cys his pro lys gly 



SEQ ID No, 4 IT (continued 1) 
FIGURE 4 IT (continued 1) 
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1441/481 1471/491 

gcg ctg acc tgg gcc ggt acc tgc aag ccg ggc gac ggt ctg ggt ate gcg ggc gac gag 
ala leu thr trp ala gly thr cys lys pro gly asp gly Xeu gly ile ala gly asp glu 
arg CPA pro gly pro val pro ala ser arg ala thr val trp val ser arg ala thr arg 
ala asp leu gly arg tyr leu gin ala gly arg arg ser gly tyr arg gly arg arg gly 
1501/501 1531/511 

gtg ctg ate gtc gcc gac gat gtc gcc gcg gcg gcc ate ggt ctg gtc gac ctg ttg ttg 
val leu ile val ala asp asp val ala ala ala ala ile gly leu val asp leu leu leu 
cys OPA ser ser pro thr met ser pro arg arg pro ser val trp ser thr cys cys trp 
ala asp arg arg arg arg cys arg arg gly gly his arg ser gly arg pro val val gly 
1561/521 1591/531 

gca teg gga ggc gat ctg gtg acg gtg eta att ggc gcc ggc gta acc gaa gac gtg get 
ala ser gly gly asp leu val thr val leu ile gly ala gly val thr glu asp val ala 
his arg glu ala ile trp OPA arg cys OCH leu ala pro ala OCH pro lys thr trp leu 
ile gly arg arg ser gly asp gly ala asn trp arg arg arg asn arg arg arg gly cys 
1621/541 1651/551 _ ^ ^ 

gtc gtc ctg gaa egg cat gtg cac gac cac cat cca ggc acc gag ctg gtc tec tac cgc 
val val leu glu arg his val his asp his his pro gly thr glu leu val ser tyr arg 
ser ser trp asn gly met cys thr thr thr ile gin ala pro ser trp ser pro thr ala 
arg pro gly thr ala cys ala arg pro pro ser arg his arg ala gly leu leu pro his 
1681/561 1711/571 
acc gga cac cgc ggc gac gcg ctg ctg ate ggg gtc gag tag 
thr gly his arg gly asp ala leu leu ile gly val glu AMB 
pro asp thr ala ala thr arg cys OPA ser gly ser ser 
arg thr pro arg arg arg ala ala asp arg gly arg val 



SEQ ID No. 4 IT (continued 2) 
FIGURE 4 IT (continued 2) 



1/1 

GCC GGT AAC GCC GCG 
ala gly asn ala ala 
61/21 

GGC GCC CCG GTC GGC 
gly ala pro val gly 
121/41 

TAT CGA GGT GGA AGC 
tyr arg gly gly ser 
181/61 

TGC TCA TCG GAA CCC 
cys ser ser glu pro 
241/81 

GCT CTT GCA GTG ACC 
ala leu ala val thr 
301/101 

CAA TGC AGT AAG TTT 
gin cys ser lys phe 
361/121 

AAC AAA CGT GGC AAT 
asn lys arg gly asn 



TCC CAG TGC TAT CCG 
ser gin cys tyr pro 

CGC GGC CGG GCT CGA 
arg gly arg ala arg 

GGA CGG TGT TGG GAT 
gly arg cys trp asp 

GCG ACG CAC ACA ATG 
ala thr his thr met 

TGA TGA TGA CAC TCA 
OPA OPA OPA his ser 

ACA CAA ACG GAC TTG 
thr gin thr asp leu 

GCC GGG CAG CCT CTG 
ala gly gin pro leu 



31/11 

TCC GCC GGA CCG CCC 
ser ala gly pro pro 
91/31 

CCC GCT CCA CCT GGC 
pro ala pro pro gly 
151/51 

GCA CGC CCA ACT TGC 
ala arg pro thr cys 
211/71 

CCC GCA GCA CCG CAC 
pro ala ala pro his 
271/91 

CCC CCA TAA GGC TCG 
pro pro OCH gly ser 
331/111 

TAA AAA CCT GCG GAG 
OCH lys pro ala glu 
391/131 

CCC TTG TCG GAT C 
pro leu ser asp 



GAA ACA TCA GCG GCG 
glu thr ser ala ala 

CAT CAG CGA CCA GGT 
his gin arg pro gly 

CGG CGA TCG CGG CGA 
arg arg ser arg arg 

GAC GGC GCC CCA CCG 
asp gly ala pro pro 

TCG GCT GCG CCT GAG 
ser ala ala pro glu 

GTG GGG TCT ATG GCC 
val gly ser met ala 



SEQ ID No,42A 
FIGURE 42A 
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Ll 31/11 

CCG GTA ACG CCG CGT CCC AGT GCT ATC CGT CCG CCG GAC CGC CCG AAA CAT CAG CGG CGG 
pro val thr pro arg pro ser ala ile arg pro pro asp arg pro lys his gin arg arg 
61/21 ^1/31 

GCG CCC CGG TCG GCC GCG GCC GGG CTC GAC CCG CTC CAC CTG GCC ATC AGC GAC CAG GTT 
ala pro arg ser ala ala ala gly leu asp pro leu his leu ala ile ser asp gin val 
121/41 151/51 

ATC GAG GTG GAA GCG GAC GGT GTT GGG ATG CAC GCC CAA CTT GCC GGC GAT CGC GGC GAT 
ile glu val glu ala asp gly val gly met his ala gin leu ala gly asp arg gly asp 
181/61 211/71 

GCT CAT CGG AAC CCG CGA CGC ACA CAA TGC CCG CAG CAC CGC ACG ACG GCG CCC CAC CGG 
ala his arg asn pro arg arg thr gin cys pro gin his arg thr thr ala pro his arg 
241/81 271/91 

CTC TTG CAG TGA CCT GAT GAT GAC ACT CAC CCC CAT AAG GCT CGT CGG CTG CGC CTG AGC 
leu leu gin OPA pro asp asp asp thr his pro his lys ala arg arg leu arg leu ser 
301/101 331/111 

AAT GCA GTA AGT TTA CAC AAA CGG ACT TGT AAA AAC CTG CGG AGG TGG GGT CTA TGG CCA 

asn ala val ser leu his lys arg thr cys lys asn leu arg arg trp gly leu trp pro 

361/121 391/131 

ACA AAC GTG GCA ATG CCG GGC AGC CTC TGC CCT TGT CGG ATC 

thr asn val ala met pro gly ser leu cys pro cys arg ile 



SEQ ID No,42B 
FIGURE 42B 



1/1 

CGG TAA CGC CGC GTC 
arg OCH arg arg val 
61/21 

CGC CCC GGT CGG CCG 
arg pro gly arg pro 
121/41 

TCG AGG TGG AAG CGG 
ser arg trp lys arg 
181/61 

CTC ATC GGA ACC CGC 
leu ile gly thr arg 
241/81 

TCT TGC AGT GAC CTG 
ser cys ser asp leu 
301/101 

ATG CAG TAA GTT TAC 
met gin OCH val tyr 
361/121 

CAA ACG TGG CAA TGC 
gin thr trp gin cys 



CCA GTG CTA TCC GTC 
pro val leu ser val 

CGG CCG GGC TCG ACC 
arg pro gly ser thr 

ACG GTG TTG GGA TGC 
thr val leu gly cys 

GAC GCA CAC AAT GCC 
asp ala his asn ala 

ATG ATG ACA CTC ACC 
met met thr leu thr 

ACA AAC GGA CTT GTA 
thr asn gly leu val 

CGG GCA GCC TCT GCC 
arg ala ala ser ala 



31/11 

CGC CGG ACC GCC CGA 
arg arg thr ala arg 
91/31 

CGC TCC ACC TGG CCA 
arg ser thr trp pro 
151/51 

ACG CCC AAC TTG CCG 
thr pro asn leu pro 
211/71 

CGC AGC ACC GCA CGA 
arg ser thr ala arg 
271/91 

CCC ATA AGG CTC GTC 
pro ile arg leu val 
331/111 

AAA ACC TGC GGA GGT 
lys thr cys gly gly 
391/131 

CTT GTC GGA TC 
leu val gly 



AAC ATC AGC GGC GGG 
asn ile ser gly gly 

TCA GCG ACC AGG TTA 
ser ala thr arg leu 

GCG ATC GCG GCG ATG 
ala ile ala ala met 

CGG CGC CCC ACC GGC 
arg arg pro thr gly 

GGC TGC GCC TGA GCA 
gly cys ala OPA ala 

GGG GTC TAT GGC CAA 
gly val tyr gly gin 



SEQ ID N0.42C 
FIGURE 42C 
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Coding sequence Rv2622 predicted by Cole et al , , 1998 (Nature 393:537-544) and 
containing seq42A: 

1/1 31/11 

atg gcc aac aaa cgt ggc aat gcc ggg cag cot ctg ccc ttg teg gat cga gac gac gac 
Met ala asn lys arg gly asn ala gly gin pro leu pro leu ser asp arg asp asp asp 
61/21 91/31 

cac atg cag ggg cac tgg ctg ctg gcc egg ctg ggc aag egg gtg ctg cgt ccc ggc ggc 
his met gin gly his trp leu leu ala arg leu gly lys arg val leu arg pro gly gly 
121/41 151/51 

gtc gaa etc ace egg aea ctg ctg gee cgc gee gag gtg acc gac gcc gac gtg etc gag 
val glu leu thr arg thr leu leu ala arg ala glu val thr asp ala asp val leu glu 
181/61 211/71 

ctg gca ccg ggc ctg ggc cgc acc gca gcc gaa ate ttg gcc cge aac eeg egg teg tac 
leu ala pro gly leu gly arg thr ala ala glu ile leu ala arg asn pro arg ser tyr 
241/81 271/91 

gtg ggg gcg gag age gat ccc aac gcg gcc aac ctg gtc cga cac gtt etc gcc ggc cgc 
val gly ala glu ser asp pro asn ala ala asn leu val arg his val leu ala gly arg 
301/101 331/111 

ggc gac gtc egg gtc ace gac gcg gcc gat acc gga tta tec gac gcc age gcc gat gtc 
qly asp val arg val thr asp ala ala asp thr gly leu ser asp ala ser ala asp val 
361/121 391/131 

gtc ate ggc gag gcg atg ctg ace atg caa ggc aac gcg get aaa cac acg ate gtc gee 
val ile gly glu ala met leu thr met gin gly asn ala ala lys his thr ile val ala 
421/141 451/151 

gag gcg gcg egg gtg ctg agg ccg ggt ggc cgc tac gcg att cac gaa eta gcg ctg gtg 
glu ala ala arg val leu arg pro gly gly arg tyr ala ile his glu leu ala leu val 
481/161 511/171 

ccg gac gac gtc gca gag cag gtc cgc ace gac ctg egg cag teg ctg gcc cgc gcg etc 
pro asp asp val ala glu gin val arg thr asp leu arg gin ser leu ala arg ala leu 
541/181 571/191 

aag gtc aat gcg cgt ccg ctg ace gtt gcg gaa tgg teg cac etc tta gcg ggc cat gga 
lys val asn ala arg pro leu thr val ala glu trp ser his leu leu ala gly his gly 
601/201 631/211 

ctg gtc gtc gaa cac gtt gtc ace get tec atg gcg ttg tta caa ccg cga egg gtg ate 
leu val val glu his val val thr ala ser met ala leu leu gin pro arg arg val ile 
661/221 691/231 

get gac gaa ggc etc ctg ggt gcg ctg egg tte gee gga aac ctg etc ate cat cgt gcc 
ala asp glu gly leu leu gly ala leu arg phe ala gly asn leu leu lie hxs arg ala 
721/241 751/251 

gcg cgt egg cga gtc ctg ttg atg cge cac aea ttc cge agg cat cgt gaa cgc ttg aea 

ala arg arg arg val leu leu met arg his thr phe arg arg his arg glu arg leu thr 

781/261 811/271 

gee gtc gee att gtc gcg cac aaa ccg cac gtc gat teg tga 

ala val ala ile val ala his lys pro his val asp ser OPA 



SEQ ID No ,420 
FIGURE 42D 
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ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv2622 
1/1 31/11 

taa aaa cct gcg gag gtg ggg tct atg gcc aac aaa cgt ggc aat gcc ggg cag cct ctg 
OCH lys pro ala glu val gly ser met ala asn lys arg gly asn ala gly gin pro leu 
61/21 91/31 

ccc ttg teg gat cga gac gac gac cac atg cag ggg cac tgg ctg ctg gcc egg ctg ggc 
pro leu ser asp arg asp asp asp his met gin gly his trp leu leu ala arg leu gly 
121/41 151/51 

aag egg gtg ctg cgt ccc ggc ggc gtc gaa etc acc egg aca ctg ctg gee egc gee gag 
lys arg val leu arg pro gly gly val glu leu thr arg thr leu leu ala arg ala glu 
181/61 211/71 

gtg acc gac gcc gac gtg etc gag ctg gca ccg ggc ctg ggc egc acc gea gee gaa ate 
val thr asp ala asp val leu glu leu ala pro gly leu gly arg thr ala ala glu ile 
241/81 271/91 

ttg gcc egc aac ccg egg teg tac gtg ggg gcg gag age gat ccc aac gcg gcc aac ctg 
leu ala arg asn pro arg ser tyr val gly ala glu ser asp pro asn ala ala asn leu 
301/101 331/111 

gtc cga cac gtt etc gee ggc egc ggc gac gtc egg gtc acc gac gcg gcc gat acc gga 
val arg his val leu ala gly arg gly asp val arg val thr asp ala ala asp thr gly 
361/121 391/131 

tta tec gac gcc age gcc gat gtc gtc ate ggc gag gcg atg ctg acc atg caa ggc aac 
leu ser asp ala ser ala asp val val ile gly glu ala met leu thr met gin gly asn 
421/141 451/151 

gcg get aaa cac acg ate gtc gcc gag gcg gcg egg gtg ctg agg ccg ggt ggc egc tac 
ala ala lys his thr ile val ala glu ala ala arg val leu arg pro gly gly arg tyr 
481/161 511/171 

gcg att cac gaa eta gcg ctg gtg ccg gac gac gtc gca gag cag gtc egc acc gac ctg 
ala ile his glu leu ala leu val pro asp asp val ala glu gin val arg thr asp leu 
541/181 571/191 

egg cag teg ctg gcc egc gcg etc aag gtc aat gcg cgt ceg ctg acc gtt gcg gaa tgg 
arg gin ser leu ala arg ala leu lys val asn ala arg pro leu thr val ala glu trp 
601/201 631/211 

teg cac etc tta gcg ggc cat gga ctg gtc gtc gaa cac gtt gtc acc get tec atg gcg 
ser his leu leu ala gly his gly leu val val glu his val val thr ala ser met ala 
661/221 691/231 

ttg tta caa ccg cga egg gtg ate get gac gaa ggc etc ctg ggt gcg ctg egg ttc gcc 
leu leu gin pro arg arg val ile ala asp glu gly leu leu gly ala leu arg phe ala 
721/241 751/251 

gga aac ctg etc ate cat cgt gcc gcg cgt egg cga gtc ctg ttg atg egc cac aca ttc 
gly asn leu leu ile his arg ala ala arg arg arg val leu leu met arg his thr phe 
781/261 811/271 

egc agg eat cgt gaa egc ttg aca gcc gtc gcc att gtc gcg cae aaa ccg cac gtc gat 
arg arg his arg glu arg leu thr ala val ala ile val ala his lys pro his val asp 
841/281 
teg tga 
ser OPA 



SEQ ID No,42F 
FIGURE 42F 
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1/1 

ate gcg cgt gac ate gat gac 
lie ala arg asp ile asp asp 
61/21 

gtt gtt tea gcg ggc gac gtg 
val val ser ala gly asp val 
121/41 

aag att aag gtc gcg ggc atg 
lys ile lys val ala gly met 
181/61 

gtt Gtg cac cgc cat ccg cac 
val leu his arg his pro his 
241/81 

ctg acc ggg ttg gcg gcg ttc 
leu thr gly leu ala ala phe 



31/11 



cag 


ggt 


egg 


ctg tgt 


ctg 


gin 


gly 


arg 


leu cys 


leu 








91/31 




gtg 


cat 


ttg 


cgt taa 


etc 


val 


his 


leu 


arg OCH 


leu 








151/51 




age 


tat 


ccg 


gag aat 


gtc 


ser 


tyr 


pro 


glu asn 


val 








211/71 




tgg 


aat 


cgc 


tta ate 


tgg 


trp 


asn 


arg 


leu ile 


trp 








271/91 




ggg 


tec 


gga 


ttc gtc 


aac 


gly 


ser 


gly phe val 


asn 



SEQ ID No,43A 
FIGURE 43A 



gac gtc ggc ggt cga acg gta 
asp val gly gly arg thr val 

gcg egg age tgg cgt cec caa 
ala arg ser trp arg pro gin 

ctg gee get ggc gag cag gtc 
leu ala ala gly glu gin val 

ccc gtc gtg gtg ctg gtc ttg 
pro val val val leu val leu 

teg aca cct tgg cag cag ate 
ser thr pro trp gin gin ile 



1/1 

teg cgc gtg aca teg atg ace 
ser arg val thr ser met thr 
61/21 

ttg ttt cag egg gcg aeg tgg 
leu phe gin arg ala thr trp 
121/41 

aga tta agg teg egg gea tga 
arg leu arg ser arg ala OPA 
181/61 

ttc tge acc gee ate cgc act 
phe cys thr ala ile arg thr 
241/81 

tga ccg ggt tgg egg cgt teg 
OPA pro gly trp arg arg ser 









31/11 




agg 


gtc 


ggc 


tgt gtc 


tgg 


arg 


val 


gly 


cys val 


trp 








91/31 




tge 


att 


tge 


gtt aac 


teg 


cys 


ile 


cys 


val asn 


ser 








151/51 




get 


ate 


egg 


aga atg 


tec 


ala 


ile 


arg 


arg met 


ser 








211/71 




gga 


ate 


get 


taa tct 


ggc 


gly 


ile 


ala 


OCH ser 


gly 








271/91 




ggt 


ccg 


gat 


teg tea 


act 


gly pro 


asp 


ser ser 


thr 



SEQ ID No,43B 
FIGURE 43B 



acg teg gcg gtc gaa egg tag 
thr ser ala val glu arg AMB 

cgc gga get ggc gtc ccc aaa 
arg gly ala gly val pro lys 

tgg ccg ctg gcg age agg teg 
trp pro leu ala ser arg ser 

ccg teg tgg tge tgg tct tge 
pro ser trp cys trp ser cys 

cga cac ctt ggc age aga tc 
arg his leu gly ser arg 
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1 / T 
1/1 












cgc gcg 


uga 




cga 


tga 


cca 


arg ala 




I^X S 


arg 


OPA 


pjTO 


61/^1 














age 


y y \3 


cga 


cgt 


ggt 


cys phe 


ser 


gly 


arg 


arg 


gly 


121/41 












gat taa 


ggt 


cgc 


ggg 


cat 


gag 


asp OCH 


gly 


arg 


gly 


his 


glu 


181/61 












tct gca 


ccg 


oca 


tec 


gca 


ctg 


ser ala 


pro 


pro 


ser 


ala 


leu 


241/81 












gac egg 


gtt 


ggc 


ggc 


gtt 


egg 


asp arg 


val 


gly 


gly val arg 



31/11 



y y y 


teg 


get 


gtg tct 


gga 


gly 


ser 


ala 


val ser 


gly 








91/31 




gca 


ttt 


gcg 


tta act 


cgc 


ala 


phe 


ala 


leu thr 


arg 








151/51 




eta 


tec 


gga 


gaa tgt 


cct 


leu 


ser 


gly glu cys 


pro 








211/71 




gaa 


teg 


ett 


aat ctg 


gee 


glu 


ser 


leu 


asn leu 


ala 






271/91 




gtc 


egg 


att 


cgt caa 


etc 


val 


arg 


lie 


arg gin 


leu 



SEQ ID No*43C 
FIGURE 43C 



cgt egg egg teg aac ggt agt 
arg arg arg ser asn gly ser 

gcg gag ctg gcg tec cca aaa 
ala glu leu ala ser pro lys 

ggc cgc tgg cga gca ggt cgt 
gly arg trp arg ala gly arg 

cgt cgt ggt get ggt ctt get 
arg arg gly ala gly leu ala 

gac acc ttg gca gca gat c 
asp thr leu ala ala asp 



Coding sequence Rv3278e predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq43A: 

1/1 31/11 

atg age tat ccg gag aat gtc ctg gcc get ggc gag cag gtc gtt ctg cac cgc cat ccg 
Met ser tyr pro glu asn val leu ala ala gly glu gin val val leu his arg his pro 
61/21 91/31 

cac tgg aat cgc tta ate tgg ccc gtc gtg gtg ctg gtc ttg ctg acc ggg ttg gcg gcg 
his trp asn arg leu ile trp pro val val val leu val leu leu thr gly leu ala ala 
121/41 151/51 

ttc ggg tec gga ttc gtc aac teg aca cct tgg cag cag ate get aag aac gtg att cac 
Phe gly ser gly phe val asn ser thr pro trp gin gin ile ala lys asn val ile his 
181/61 211/71 

qcg gtc ate tgg ggg ate tgg ttg gtg ate gtc ggc tgg etc aeg ctg tgg cca ttc ctg 
ala val ile trp gly ile trp leu val ile val gly trp leu thr leu trp pro phe leu 
241/81 271/91 

age tgg ctg ace aca cat ttc gtg gtg acc aac egg egg gtg atg ttc egg eat ggt gtg 
ser trp leu thr thr his phe val val thr asn arg arg val met phe arg his gly val 
301/101 331/111 

etg acc ege age ggg ate gac ata ccg eta gca egg ate aac age gtg gag ttc egg gac 
leu thr arg ser gly ile asp ile pro leu ala arg ile asn ser val glu phe arg asp 
361/121 391/131 

egg ate ttc gag egg att ttt cgc acc ggg aeg ttg att ate gag tec gcg tea caa gat 
arg ile phe glu arg ile phe arg thr gly thr leu ile ile glu ser ala ser gin asp 
421/141 451/151 

ccq etc gag ttc tac aac att ccg cgc ctg egg gag gtg eat gcg ttg ctg tat cac gag 
pro leu glu phe tyr asn ile pro arg leu arg glu val his ala leu leu tyr his glu 
481/161 511/171 
gtt ttc gac aec etg ggc tec gac gag teg ccc age tga 
val phe asp thr leu gly ser asp glu ser pro ser OPA 



SEQ ID No. 4 3D 
FIGURE 43D 
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ORF according to Cole et al . . 1998 (Nature 393:537-544) and containing Rv3278c 
1/1 31/11 

taa etc gcg egg age tgg cgt ccc eaa aag att aag gtc gcg ggc atg age tat cog gag 
OCH leu ala arg ser trp arg pro gin lys lie lys val ala gly met ser tyr pro glu 
61/21 91/31 

aat gtc ctg gee get ggc gag cag gtc gtt ctg cac cgc cat ccg cac tgg aat cgc tta 
asn val leu ala ala gly glu gin val val leu his arg his pro his trp asn arg leu 
121/41 151/51 

ate tgg ecc gtc gtg gtg ctg gtc ttg ctg acc ggg ttg gcg gcg ttc ggg tec gga ttc 
ile trp pro val val val leu val leu leu thr gly leu ala ala phe gly ser gly phe 
181/61 211/71 

gtc aac teg aca cet tgg cag cag ate get aag aac gtg att cac gcg gtc ate tgg ggg 
val asn ser thr pro trp gin gin ile ala lys asn val ile his ala val ile trp gly 
241/81 271/91 

ate tgg ttg gtg ate gtc ggc tgg etc aeg ctg tgg cea ttc ctg age tgg ctg acc aca 
ile trp leu val ile val gly trp leu thr leu trp pro phe leu ser trp leu thr thr 
301/101 331/111 

cat ttc gtg gtg acc aac egg egg gtg atg ttc egg cat ggt gtg ctg acc cgc age ggg 
his phe val val thr asn arg arg val met phe arg his gly val leu thr arg ser gly 
361/121 391/131 

ate gae ata ccg eta gea egg ate aac age gtg gag ttc egg gae egg ate ttc gag egg 
ile asp ile pro leu ala arg ile asn ser val glu phe, arg asp arg ile phe glu arg 
421/141 451/151 

att ttt cgc acc ggg aeg ttg att ate gag tec gcg tea eaa gat ccg etc gag ttc tac 
ile phe arg thr gly thr leu ile ile glu ser ala ser gin asp pro leu glu phe tyr 
481/161 511/171 

aac att ccg cgc ctg egg gag gtg eat gcg ttg ctg tat cac gag gtt ttc gae acc ctg 
asn ile pro arg leu arg glu val his ala leu leu tyr his glu val phe asp thr leu 
541/181 

ggc tec gae gag teg ccc age tga 
gly ser asp glu ser pro ser OPA 



SEQ ID N0.43F 



FIGURE 43F 



143/185 



1/1 

gcc aag atg gat gtc tac caa 
ala lys met asp val tyr gin 
61/21 

acc acc cat ate ggt teg gcg 
thr thr his ile gly ser ala 
121/41 

ccg atg ggg gtt tac age ctg 
pro met gly val tyr ser leu 
181/61 

ttg ccg tat acc caa gtc gga 
leu pro tyr thr gin val gly 
241/81 

ttt aac tec atg cag gtc tgt 
phe asn ser met gin val cys 
301/101 

aac ctg caa ate ccg cag tac 
asn leu gin ile pro gin tyr 
361/121 

cca ggc aaa ggc tec gcg ttc 
pro gly lys gly ser ala phe 
421/141 
gtg gcg ate 
val ala ile 



31/11 

cgc acc gcc gcc ggc tgg 
arg thr ala ala gly trp 
91/31 

ggc atg gcg ccg gaa gcc 
gly met ala pro glu ala 

151/51 
gac tec get ttt ggc ace 
asp ser ala phe gly thr 

211/71 
ccc aat cac tgg tgg agt 
pro asn his trp trp ser 

271/91 
cag aag tec cag tgc ccg 
gin lys ser gin cys pro 

331/111 
aag cat teg gtc gtg atg 
lys his ser val val met 

391/131 
ttc ttt cac acc acc gac 
phe phe his thr thr asp 



SEQ ID N0.44A 
FIGURE 44A 



cag ccg etc aag acc ggt ate 
gin pro leu lys thr gly ile 

aag age gga tat ccg gcc act 
lys ser gly tyr pro ala thr 

gcg ccg aat ccc ggt ggc ggg 
ala pro asn pro gly gly gly 

ggc gac gac aat age ccc acc 
gly asp asp asn ser pro thr 

ttc age acg gcc gac age gag 
phe ser thr ala asp ser glu 

ggc gtc aac aag gee aag gtc 
gly val asn lys ala lys val 

ggc ggg ccc acc gcg ggt tgt 
gly gly pro thr ala gly cys 



1/1 

cca aga tgg atg tct ace aac 
pro arg trp met ser thr asn 
61/21 

cca ccc ata teg gtt egg egg 
pro pro ile ser val arg arg 
121/41 

cga tgg ggg ttt aca gcc tgg 
arg trp gly phe thr ala trp 

181/61 

tgc egt ata ccc aag teg gac 
cys arg ile pro lys ser asp 
241/81 

tta act cca tgc agg tct gtc 
leu thr pro cys arg ser val 
301/101 

ace tgc aaa tec cgc agt aca 
thr cys lys ser arg ser thr 
361/121 

cag gca aag get ccg egt tct 
gin ala lys ala pro arg ser 
421/141 
tgg cga tc 
trp arg 



31/11 

gca ccg ccg ccg get ggc 
ala pro pro pro ala gly 
91/31 

gca tgg cgc egg aag cca 
ala trp arg arg lys pro 
151/51 

act ccg ctt ttg gca ccg 
thr pro leu leu ala pro 

211/71 
cca ate act ggt gga gtg 
pro ile thr gly gly val 

271/91 
aga agt ccc agt gee egt 
arg ser pro ser ala arg 

331/111 
age att egg teg tga tgg 
ser ile arg ser OPA trp 

391/131 
tct ttc aca cca ccg acg 
ser phe thr pro pro thr 



SEQ ID No,44B 
FIGURE 44B 



age cgc tea aga ccg gta tea 
ser arg ser arg pro val ser 

aga gcg gat ate egg cca etc 
arg ala asp ile arg pro leu 

cgc cga ate ccg gtg gcg ggt 
arg arg ile pro val ala gly 

gcg acg aca ata gcc cca cct 
ala thr thr ile ala pro pro 

tea gca egg ccg aca gcg aga 
ser ala arg pro thr ala arg 

gcg tea aca agg cca agg tec 
ala ser thr arg pro arg ser 

gcg ggc cca ccg egg gtt gtg 
ala gly pro pro arg val val 



144/185 



1/1 

caa gat gga tgt eta 
gin asp gly cys leu 
61/21 

cac cca tat egg ttc 
his pro tyr arg phe 
121/41 

gat ggg ggt tta cag 
asp gly gly leu gin 
181/61 

gcc gta tac cca agt 
ala val tyr pro ser 
241/81 

taa etc cat gca ggt 
OCH leu his ala gly 
301/101 

cct gca aat ccc gca 
pro ala asn pro ala 
361/121 

agg caa agg etc cgc 
arg gin arg leu arg 
421/141 
ggc gat c 
gly asp 



cca acg cac cgc cgc 
pro thr his arg arg 

ggc ggg cat ggc gcc 
gly gly his gly ala 

cct gga etc cgc ttt 
pro gly leu arg phe 

egg ace caa tea ctg 
arg thr gin ser leu 

ctg tea gaa gtc cca 
leu ser glu val pro 

gta caa gca ttc ggt 
val gin ala phe gly 

gtt ctt ctt tea cac 
val leu leu ser his 



31/11 

egg ctg gca gcc get 
arg leu ala ala ala 
91/31 

gga age caa gag egg 
gly ser gin glu arg 
151/51 

tgg cac cgc gcc gaa 
trp his arg ala glu 
211/71 

gtg gag tgg cga cga 
val glu trp arg arg 
271/91 

gtg ccc gtt cag cac 
val pro val gin his 
331/111 

cgt gat ggg egt caa 
arg asp gly arg gin 
391/131 

cac cga egg egg gcc 
his arg arg arg ala 



caa gac egg tat cac 
gin asp arg tyr his 

ata tec ggc cac tec 
ile ser gly his ser 

tec egg tgg egg gtt 
ser arg trp arg val 

caa tag ccc cac ctt 
gin AMB pro his leu 

ggc cga cag cga gaa 
gly arg gin arg glu 

caa ggc caa ggt ccc 
gin gly gin gly pro 

cac cgc ggg ttg tgt 
his arg gly leu cys 



SEQ ID N0.44C 
FIGURE 44C 
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Coding sequence Rv0309 predicted by Cole et al,, 1998 (Nature 393:537-544) and 
containing Seq4 4A: 

1/1 31/11 

atg age cga etc eta get ttg ctg tgc get gcg gta tgc acg ggc tgc gtt get gtg gtt 
Met ser arg leu leu ala leu leu cys ala ala val cys thr gly cys vai ala val val 
61/21 51/31 

etc gcg cea gtg age ctg gee gte gtc aac ccg tgg ttc gcg aac teg gte ggc aat gee 
leu ala pro val ser leu ala val val asn pro trp phe ala asn ser val gly asn ala 
121/41 151/51 

act cag gtg gtt teg gtg gtg gga acc ggc ggt teg acg gcc aag atg gat gtc tac caa 
thr gin val val ser val val gly thr gly gly ser thr ala lys met asp val tyr gin 
181/61 211/71 

cge acc gcc gcc ggc tgg cag ccg etc aag acc ggt ate acc acc cat ate ggt teg gcg 
arg thr ala ala gly trp gin pro leu lys thr gly lie thr thr his ile gly ser ala 
241/81 271/91 

ggc atg gcg ceg gaa gee aag age gga tat ccg gcc act ccg atg ggg gtt tac age ctg 
gly met ala pro glu ala lys ser gly tyr pro ala thr pro met gly val tyr ser leu 
301/101 331/111 

gac tec get ttt ggc acc gcg ccg aat ccc ggt ggc ggg ttg ccg tat acc caa gtc gga 
asp ser ala phe gly thr ala pro asn pro gly gly gly leu pro tyr thr gin val gly 
361/121 391/131 

ccc aat cae tgg tgg agt ggc gac gac aat age eec ace ttt aac tee atg cag gtc tgt 
pro asn his trp trp ser gly asp asp asn ser pro thr phe asn ser met gin val eys 
421/141 451/151 

cag aag tec cag tgc ccg ttc age aeg gcc gac age gag aac ctg caa ate ceg cag tac 
gin lys ser gin cys pro phe ser thr ala asp ser glu asn leu gin ile pro gin tyr 
481/161 511/171 

aag cat teg gte gtg atg ggc gtc aac aag gcc aag gtc cca ggc aaa ggc tec gcg ttc 
lys his ser val val met gly val asn lys ala lys val pro gly lys gly ser ala phe 
541/181 571/191 

ttc ttt cac ace acc gac ggc ggg ccc acc gcg ggt tgt gtg gcg ate gac gat gee aeg 
phe phe his thr thr asp gly gly pro thr ala gly eys val ala ile asp asp ala thr 
601/201 631/211 

ctg gtg cag ate ate cgt tgg ctg egg ect ggt gcg gtg ate gcg ate gee aag taa 
leu val gin ile ile arg trp leu arg pro gly ala val ile ala ile ala lys OCH 



SEQ ID N0.44D 
FIGURE 44D 
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ORF according to Cole et al., 1998 { 



1/1 

tga gcg atg age cga etc eta get ttg ctg 
CPA ala met ser arg leu leu ala leu leu 
61/21 

gtg gtt etc gcg cca gtg age ctg gee gtc 
val val leu ala pro val ser leu ala val 
121/41 

aat gee act cag gtg gtt teg gtg gtg gga 
asn ala thr gin val val ser val val gly 
181/61 

tac caa cgc acc gcc gee ggc tgg cag ecg 
tyr gin arg thr ala ala gly trp gin pro 
241/81 

teg gcg ggc atg gcg ccg gaa gcc aag age 
ser ala gly met ala pro glu ala lys ser 
301/101 

age ctg gac tec get ttt ggc acc gcg ccg 
ser leu asp ser ala phe gly thr ala pro 
361/121 

gtc gga cce aat cac tgg tgg agt ggc gac 
val gly pro asn his trp trp ser gly asp 
421/141 

gtc tgt cag aag tec cag tgc ecg ttc age 
val cys gin lys ser gin cys pro phe ser 
481/161 

cag tac aag cat teg gtc gtg atg ggc gtc 
gin tyr lys his ser val val met gly val 
541/181 

gcg ttc ttc ttt cac acc acc gac ggc ggg 
ala phe phe phe his thr thr asp gly gly 
601/201 

gcc aeg ctg gtg cag ate ate egt tgg ctg 
ala thr leu val gin ile ile arg trp leu 
661/221 
taa 
OCH 



re 393:537-544) and containing Rv0309 



31/11 

tgc get gcg gta tgc aeg ggc tgc gtt get 
cys ala ala val cys thr gly cys val ala 
91/31 

gtc aac ecg tgg ttc gcg aac teg gtc ggc 
val asn pro trp phe ala asn ser val gly 
151/51 

acc ggc ggt teg aeg gcc aag atg gat gtc 
thr gly gly ser thr ala lys met asp val 
211/71 

etc aag acc ggt ate ace acc eat ate ggt 
leu lys thr gly ile thr thr his ile gly 
271/91 

gga tat ccg gcc act ccg atg ggg gtt tac 
gly tyr pro ala thr pro met gly val tyr 
331/111 

aat cce ggt ggc ggg ttg ccg tat acc caa 
asn pro gly gly gly leu pro tyr thr gin 
391/131 

gac aat age cce acc ttt aac tec atg cag 
asp asn ser pro thr phe asn ser met gin 
451/151 

aeg gcc gac age gag aac ctg caa ate ccg 
thr ala asp ser glu asn leu gin ile pro 
511/171 

aac aag gee aag gtc cca ggc aaa ggc tec 
asn lys ala lys val pro gly lys gly ser 
571/191 

cce ace gcg ggt tgt gtg gcg ate gac gat 
pro thr ala gly cys val ala ile asp asp 
631/211 

egg cct ggt gcg gtg ate gcg ate gcc aag 
arg pro gly ala val ile ala ile ala lys 



SEQ ID N0.44F 
FIGURE 44F 
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Cloned fragment fused with phoA 
1/1 

gat etc ccc gga eac cag gtc ate egg cga 
asp leu pro gly his gin val ile arg arg 
61/21 

tec ggt age cag agg cac cag cat cag caa 
ser gly ser gin arg his gin his gin gin 
121/41 

ggt cct tgc cac teg cga tec ttg gga tga 
gly pro cys his ser arg ser leu gly OPA 
181/61 

ate gtg cca gac egg gca tgc cgc gtc ggc 
ile val pro asp arg ala cys arg val gly 
241/81 

gtg cga ccc agg atg gcg aat get egg ggg 
val arg pro arg met ala asn ala arg gly 
301/101 

teg gtg aag ccc aac egg egg tac aac cgc 
ser val lys pro asn arg arg tyr asn arg 
361/121 

ggt gtg gag age agg acg ttg tec teg teg 
gly val glu ser arg thr leu ser ser ser 
421/141 

tec ccg agg cca egg cct tga gcg egg gga 
ser pro arg pro arg pro OPA ala arg gly 
481/161 

etg gtc ate agt egg gcg ate get agg cgc 
leu val ile ser arg ala ile ala arg arg 
541/181 

ace tgc tgt tgc cac cac tgg ccg ggc gcc 
thr cys cys cys his his trp pro gly ala 
601/201 

ggc gcg ttg etc agt teg gcg gee gac ggc 
gly ala leu leu ser ser ala ala asp gly 
661/221 

teg get gee gtt ace teg aeg gcc gcg ace 
ser ala ala val thr ser thr ala ala thr 
721/241 

age cac att ggg gcg cgc aaa gtc teg gtg 
ser his ile gly ala arg lys val ser val 
781/261 

ace gtc agg gca tea ccg agg egg cgc tec 
thr val arg ala ser pro arg arg arg ser 
841/281 

ate gee aac gcg egg tgt cct cct eat gtg 
ile ala asn ala arg cys pro pro his val 
901/301 

egg aca age cga tga ggc cgc ccg cgc tgg 
arg thr ser arg OPA gly arg pro arg trp 



31/11 

gat ggt gat cga ggc teg gac ccg cag gca 
asp gly asp arg gly ser asp pro gin ala 
91/31 

cat cgc gat ggc cag cat gee gcg ccg teg 
his arg asp gly gin his ala ala pro ser 
151/51 

egg tgg ggc ata get age gcg cac cag gtc 
arg trp gly ile ala ser ala his gin val 
211/71 

aag ctg teg ggc gcg ggt tag age ggt age 
lys leu ser gly ala gly AMB ser gly ser 
271/91 

tea ccg gcg aag tgg tag ccg egg atg atg 
ser pro ala lys trp AMB pro arg met met 
331/111 

cac gcc cga ttg tec tea ccg ttg gtc tec 
his ala arg leu ser ser pro leu val ser 
391/131 

cga ccg get age agt egg egg gcc aac gcc 
arg pro ala ser ser arg arg ala asn ala 
451/151 

agg atg tgc aat tea gtc aac teg aag tag 
arg met cys asn ser val asn ser lys AMB 
511/171 

gga aag ccg ctg cgt tgc aag ccc agt acc 
gly lys pro leu arg cys lys pro ser thr 
571/191 

ccg gga tag eeg tac gee act ccg age att 
pro gly AMB pro tyr ala thr pro ser ile 
631/211 

age gee gtg gtg teg gcg gcc teg gee tgt 
ser ala val val ser ala ala ser ala cys 
691/231 

gcc tgc cag ccg cgc cgc egg atg tgc tee 
ala cys gin pro arg arg arg met cys ser 
751/251 

ccc ctg ggg tag cgc ate gcg teg aca tac 
pro leu gly AMB arg ile ala ser thr tyr 
811/271 

ata teg etg ggc ggc aga teg atg agg aat 
ile ser leu gly gly arg ser met arg asn 
871/291 

atg aac cga tgc gtg ctt gcg cac cag tat 
met asn arg cys val leu ala his gin tyr 
931/311 

acg ggg ctt gta gcg tat ggc cgt ttc cgc 
thr gly leu val ala tyr gly arg phe arg 



SEQ ID No,45ZA 
FIGURE ASZK 
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961/321 

tea get cgt cgc tgc 
ser ala arg arg cys 
1021/341 

gat tga cct cgt ate 
asp OPA pro arg ile 
1081/361 

att acg tga cag tct 
ile thr OPA gin ser 
1141/381 

atg ctt gac cag ate 
met leu asp gin ile 
1201/401 

cgt ggc ggg ggc ttc 
arg gly gly gly phe 
1261/421 

ate ate ggt ctg ggg 
ile ile gly leu gly 
1321/441 

ttc ecg ata etc age 
phe pro ile leu ser 
1381/461 

ace ggt cct egg ttg 
thr gly pro arg leu 
1441/481 

cag cgt cgt acc aag 
gin arg arg thr lys 



ggc gee gcc ggg ata 
gly ala ala gly ile 

ate tga gtt agt tgc 
ile OPA val ser cys 

gtc ggc aag gag gga 
val gly lys glu gly 

gag age get etc tac 
glu ser ala leu tyr 

cgc gea ccg acc gcg 
arg ala pro thr ala 

atg ttg gtt tec ggc 
met leu val ser gly 

gtt ttc ggt ttt gtc 
val phe gly phe val 

tec ggc agg atg gat 
ser gly arg met asp 

ggg gcc ggg ggc tea 
gly ala gly gly ser 



991/331 

gaa teg cce gcg aac 
glu ser pro ala asn 
1051/351 

ccg cgc aat ggg cat 
pro arg asn gly his 
1111/371 

cgc atg cca ctb tec 
arg met pro leu ser 
1171/391 

gee gaa gat cce aag 
ala glu asp pro lys 
1231/411 

egg egg cgc ctg cag 
arg arg arg leu gin 
1291/431 

gtg gcg ttc aaa gag 
val ala phe lys glu 
1351/451 

gtg atg ttc ggt ggt 
val met phe gly gly 
1411/471 

cgt ggc gga teg get 
arg gly gly ser ala 
1471/491 

ttc acc age cgt atg 
phe thr ser arg met 



cag tgg tac ggc gca 
gin trp tyr gly ala 

ccg cgt gtt ate ggt 
pro arg val ile gly 

gat cat gag cag egg 
asp his glu gin arg 

ttc gca teg agt gtc 
phe ala ser ser val 

ggc gcg gcg ttg ttc 
gly ala ala leu phe 

acc atg ate gga agt 
thr met ile gly ser 

gtg gtg tat gee ate 
val val tyr ala ile 

get ggg get teg cgc 
ala gly ala ser arg 

gaa gat c 
glu asp 
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fragment seq45ZA shifted minus 1 for the reading frame 
1/1 31/11 

ate tec ccg gac acc agg tea tec ggc gag atg gtg ate gag get egg aec cgc agg cat 
lie ser pro asp thr arg ser ser gly glu met val lie glu ala arg thr arg arg his 
61/21 91/31 

ccg gta gee aga ggc ace age ate age aae ate geg atg gee age atg ccg cge cgt egg 
pro val ala arg gly thr ser lie ser asn lie ala met ala ser met pro arg arg arg 
121/41 151/51 

gte ctt gcc act cgc gat cct tgg gat gac ggt ggg gca tag eta gcg cgc acc agg tea 
val leu ala thr arg asp pro trp asp asp gly gly ala AMB leu ala arg thr arg ser 
181/61 211/71 

teg tge cag acc ggg cat gcc gcg teg gca age tgt egg gcg egg gtt aga gcg gta gcg 
ser cys gin thr gly his ala ala ser ala ser cys arg ala arg val arg ala val ala 
241/81 271/91 

tgc gac cca gga tgg cga atg etc ggg ggt cac egg cga agt ggt age cgc gga tga tgt 
cys asp pro gly trp arg met leu gly gly his arg arg ser gly ser arg gly OPA cys 
301/101 331/111 

egg tga age cca acc ggc ggt aea aec gcc acg ccc gat tgt cct cac cgt tgg tct ccg 
arg OPA ser pro thr gly gly thr thr ala thr pro asp cys pro his arg trp ser pro 
361/121 391/131 

gtg tgg aga gca gga cgt tgt ect cgt cge gac egg eta gea gtc ggc ggg cca acg ect 
val trp arg ala gly arg cys pro arg arg asp arg leu ala val gly gly pro thr pro 
421/141 451/151 

ccc cga ggc cac ggc ctt gag cgc ggg gaa gga tgt gca att cag tea act cga agt age 
pro arg gly his gly leu glu arg gly glu gly cys ala ile gin ser thr arg ser ser 
481/161 511/171 

tgg tea tea gtc ggg cga teg eta ggc gcg gaa age cge tge gtt gca age cca gta cca 
trp ser ser val gly arg ser leu gly ala glu ser arg cys val ala ser pro val pro 
541/181 571/191 

cct get gtt gcc acc act ggc egg gcg ccc egg gat age cgt acg cca etc cga gca ttg 
pro ala val ala thr thr gly arg ala pro arg asp ser arg thr pro leu arg ala leu 
601/201 631/211 

geg cgt tgc tea gtt egg egg ccg acg gca gcg ccg tgg tgt egg egg cct egg cct gtt 
ala arg cys ser val arg arg pro thr ala ala pro trp cys arg arg pro arg pro val 
661/221 691/231 

egg ctg ccg tta cct cga egg ccg cga ccg cct gcc age cgc gee gee gga tgt get cca 
arg leu pro leu pro arg arg pro arg pro pro ala ser arg ala ala gly cys ala pro 
721/241 751/251 

gcc aea ttg ggg cge gca aag tct egg tge ccc tgg ggt age gca teg cgt cga cat aca 
ala thr leu gly arg ala lys ser arg cys pro trp gly ser ala ser arg arg his thr 
781/261 811/271 

ccg tea ggg eat cac cga ggc ggc get cca tat cgc tgg gcg gca gat cga tga gga ata 
pro ser gly his his arg gly gly ala pro tyr arg trp ala ala asp arg OPA gly ile 
841/281 871/291 

teg cca acg cge ggt gtc etc etc atg tga tga acc gat gcg tgc ttg cgc acc agt ate 
ser pro thr arg gly val leu leu met OPA OPA thr asp ala cys leu arg thr ser lie 
901/301 931/311 

gga caa gee gat gag gcc gcc cgc get gga egg ggc ttg tag cgt atg gee gtt tec get 
qlv gin ala asp glu ala ala arg ala gly arg gly leu AMB arg met ala val ser ala 
961/321 991/331 

cag etc gtc get gcg gcg ccg ccg gga tag aat cgc ccg cga acc agt ggt acg gcg cag 
gin leu val ala ala ala pro pro gly AMB asn arg pro arg thr ser gly thr ala gin 
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1021/341 
















1051/351 
















att gac etc 


gta 


tea 


tct 


gag 


tta 


gtt 


gee 


cge gca atg 


ggc 


ate 


cge 


gtg 


tta 


teg 


gta 


ile asp leu 


val 


ser 


ser 


glu 


leu 


val 


ala 


arg ala met 


gly 


ile 


arg 


val 


leu 


ser 


val 


1081/361 
















1111/371 
















tta cgt gac 


agt 


ctg 


teg 


gca 


agg 


agg 


gac 


gca tgc cac 


tct 


ccg 


ate 


atg 


age 


age 


gga 


leu arg asp 


ser 


leu 


ser 


ala 


arg 


arg 


asp 


ala cys his 


ser 


pro 


ile 


met 


ser 


ser 


gly 


1141/381 
















1171/391 
















tgc ttg acc 


aga 


teg 


aga 


gcg 


etc 


tct 


aeg 


ccg aag ate 


cca 


agt 


teg 


eat 


cga 


gtg 


tee 


cys leu thr 


arg 


ser 


arg 


ala 


leu 


ser 


thr 


pro lys ile 


pro 


ser 


ser 


his 


arg 


val 


ser 


1201/401 
















1231/411 
















gtg gcg ggg 


get 


tec 


gcg 


cac 


cga 


ccg 


cge 


ggc ggc gee 


tgc 


agg 


gcg 


egg 


cgt 


tgt 


tea 


val ala gly 


ala 


ser 


ala 


his 


arg 


pro 


arg 


gly gly ala 


cys 


arg 


aia 


arg 


arg 


cys 


ser 


1261/421 
















1291/431 
















tea teg gtc 


tgg 


gga 


tgt 


tgg 


ttt 


ccg 


gcg 


tgg cgt tea 


aag 


aga 


cca 


tga 


teg 


gaa 


gtt 


ser ser val 


trp 


gly 


cys 


trp phe pro 


ala 


trp arg ser 


lys 


arg 


pro 


OPA 


ser 


glu 


val 


1321/441 
















1351/451 
















tec cga tac 


tea 


gcg 


ttt 


teg 


gtt 


ttg 


teg 


tga tgt teg 


gtg 


gtg 


tgg 


tgt 


atg 


cca 


tea 


ser arg tyr 


ser 


ala 


phe 


ser 


val 


leu 


ser 


OPA cys ser 


val 


val 


trp 


cys 


met 


pro 


ser 


1381/461 
















1411/471 
















cog gtc etc 


ggt 


tgt 


ccg 


gca 


gga 


tgg 


ate 


gtg gcg gat 


egg 


ctg 


ctg 


ggg 


ctt 


cge 


gcc 


pro val leu 


gly 


cys 


pro 


ala 


gly trp 


ile 


val ala asp 


arg 


leu 


leu 


gly 


leu 


arg 


ala 


1441/481 
















1471/491 
















age gtc gta 


cca 


agg 


ggg 


ccg 


ggg 


get 


cat 


tea cca gcc 


gta 


tgg 


aag 


ate 








ser val val 


pro 


arg 


gly pro 


gly ala his 


ser pro ala 


val 


trp 


lys 


ile 









SEQ ID No,45ZB {continued) 
FIGURE 45ZB (continued) 



151/185 



fragment seq45ZA shifted minus 2 for the reading frame 



1/1 








V — \— ^v^v— 




aca 


cca 


s€2r pro 


arg 


thr 


pro 


61/21 








egg tag 


cca 


aaa 
y "^y 


aca 
y 


arg AMB 




qIu 


ala 


121/41 








tec ttg 


cca 


etc 


aca 
y ^y 


ser leu 


ore 


leu 


ala 


181/61 








cgt gcc 


aga 


ccg 


aac 
y y ^ 


arg ala 


arg 


pro 




241/81 








gcg acc 


cacr 


aat 
y ^ ^ 


aac 
y y ^ 


ala thr 


gin 


asp 


alv 

y jf 


301/101 








rrrtt 


gcc 


caa 


ccg 


cfl V nlu 
3 J; y 


ala 


gin 


pro 


361/121 








tcrt acra 
^3 yy 


OSLO 

y <-*-y 


cag 


gac 


CVS cflv 


glu 


gin 


asp 


421/141 








ccc gag 


gcc 


a eg 


gcc 


pro glu 


ala 


thr 


ala 


481/161 








Qcrt cat 


caa 


tea 


aac 


gly his 


gin 


ser 


alv 


541/181 








ctg ctg 


tta 


cca 


cca 


leu leu 


leu 


pro 


pro 


601/201 








cnc citt 

Vp-*^ V-^ ^ V- ^ 


get 


cag 


ttc 


arg val 


ala 


gin 


phe 


661/221 








ggc tgc 


cat 


tac 


etc 


gly cys 


arg 


tyr 


leu 


721/241 








cca cat 


tgg 


ggc 


gcg 


pro his 


trp 


gly 


ala 


781/261 








cgt cag 


ggc 


ate 


acc 


arg gin 


gly 


ile 


thr 


841/281 








cgc caa 


cgc 


gcg 


gtg 


arg gin 


arg 


ala 


val 


901/301 








gac aag 


ccg 


atg 


agg 


asp lys 


pro 


met 


arg 



ggt cat ccg gcg aga 
gly his pro ala arg 

cca gca tea gca aca 
pro ala ser ala thr 

ate ctt ggg atg acg 
ile leu gly met thr 

atg ccg cgt egg caa 
met pro arg arg gin 

gaa tgc teg ggg gtc 
glu cys ser gly val 

gcg gta caa ccg cca 
ala val gin pro pro 

gtt gtc etc gtc gcg 
val val leu val ala 

ttg age gcg ggg aag 
leu ser ala gly lys 

gat cgc tag gcg egg 
asp arg AMB ala arg 

ctg gcc ggg cgc ccc 
leu ala gly arg pro 

ggc ggc ega egg cag 
gly gly arg arg gin 

gac ggc cgc gac cgc 
asp gly arg asp arg 

caa agt etc ggt gcc 
gin ser leu gly ala 

gag gcg gcg etc cat 
glu ala ala leu his 

tec tec tea tgt gat 
ser ser ser cys asp 

ccg ccc gcg ctg gac 
pro pro ala leu asp 



31/11 










teg 




etc 


trp OPA 


ser 


arg 


1 eu 


91/31 










egg 






ser arg 


trp 


pro 


ala 


151/51 








ata aaa 
y *'y yyy 


cat 


age 


tag 


val gly 


his 


ser 


AMB 


211/71 








get gtc 


aaa 
yyy 


cgc 


aaa 
yyy 


ala val 


alv 


arg 


alv 


271/91 








ace ggc 


gaa 


ata 
y "-y 


gta 


thr gly 


glu 


val 


val 


331/111 








cgc ccg 


att 


gtc 


etc 


arg pro 


ile 


val 


leu 


391/131 








acc ggc 


tag 


cag 


teg 


thr gly 


AHB 


gin 


ser 


451/151 








y Im^ W ^ y 


caa 


ttc 


agt 


asp val 


gin 


phe 


ser 


511/171 








aaa gee 


get 


aca 
y ^y 


ttg 


lys ala 


ala 


ala 


leu 


571/191 








ggg ata 


gcc 


gta 


cgc 


alv ile 


ala 


val 


a rg 


631/211 








cgc cgt 


oat 
y y ^ 


gtc 


Cine 
yy ^ 


arg arg 


alv 


val 


a1 V 
y-'-y 


691/231 








ctg cca 


gcc 


gcg 


ccg 


leu pro 


ala 


ala 


pro 


751/251 








cct ggg 


gta 


gcg 


cat 


pro gly 


val 


ala 


his 


811/271 








ate get 


ggg 


egg 


cag 


ile ala 


gly 


arg 


gin 


871/291 








gaa ccg 


atg 


cgt 


get 


glu pro 


met 


arg 


ala 


931/311 








ggg get 


tgt 


age 


gta 


gly ala 


cys 


ser 


val 



gga 
gly 


ccc 
pro 


gca 
ala 


ggc 
gly 


ate 
ile 


tgc 
cys 


cgc 
arg 


gcc 
ala 


gtc 
val 


ggg 

gly 


cgc 
arg 


gca 
ala 


cca 
pro 


ggt cat 
gly his 


tta 
leu 


gag 
glu 


egg 
arg 


tag 
AMB 


cgt 
arg 


gcc 
ala 


gcg 
ala 


gat 
asp 


gat 
asp 


gtc 
val 


ace 
thr 


gtt 
val 


ggt 
gly 


etc 
leu 


egg 
arg 


gcg 
ala 


ggc 
gly 


caa 
gin 


cgc 
arg 


etc 
leu 


caa 
gin 


etc 
leu 


gaa 
glu 


gta 
val 


get 
ala 


caa 
gin 


gcc 
ala 


cag 
gin 


tac 
tyr 


cac 
his 


cac 
his 


tec 
ser 


gag 
glu 


cat 
his 


tgg 
trp 


ggc 

gly 


etc 
leu 


ggc 
gly 


ctg ttc 
leu phe 


ccg 
pro 


gat 
^sp 


gtg 

val 


etc 
leu 


cag 
gin 


cgc 
arg 


gtc 
val 


gac 
asp 


ata 
ile 


cac 
his 


ate 
ile 


gat 
asp 


gag 
glu 


gaa 
glu 


tat 
tyr 


tgc 
cys 


gca 
ala 


cca 
pro 


gta 
val 


teg 
ser 


tgg 
trp 


ccg 
pro 


ttt 
phe 


ccg 
pro 


etc 
leu 



SEQ ID No,45ZC 
FIGURE 452;C 



152/185 



961/321 
















991/331 
















age teg teg 


etg 


egg 


egc 


cgc 


egg 


gat 


aga 


ate gee cgc 


gaa 


cca 


gtg 


gta 


egg 


cgc 


aga 


ser ser ser 


leu 


arg 


arg 


arg 


arg 


asp 


arg 


ile ala arg 


glu 


pro 


val 


val 


arg 


arg 


arg 


1021/341 
















1051/351 
















ttg acc teg 


tat 


cat 


etg 


agt 


tag 


ttg 


ccc 


gcg caa tgg 


gca 


tec 


gcg 


tgt 


tat 


egg 


tat 


leu thr ser 


tyr 


his 


leu 


ser 


AMB 


leu 


pro 


ala gin trp 


ala 


ser 


ala 


cys 


tyr 


arg 


tyr 


1081/361 
















1111/371 
















tac gtg aca 


gtc 


tgt 


egg 


caa 


gga 


ggg 


acg 


cat gee act 


etc 


cga 


tea 


tga 


gca 


gcg 


gat 


tyr val thr 


val 


cys 


arg 


gin 


gly 


gly 


thr 


his ala thr 


leu 


arg 


ser 


OPA 


ala 


ala 


asp 


1141/381 
















1171/391 
















get tga cca 


gat 


cga 


gag 


cgc 


tct 


eta 


cgc 


cga aga tec 


caa 


gtt 


cgc 


ate 


gag 


tgt 


ccg 


ala OPA pro 


asp 


arg 


glu 


arg 


ser 


leu 


arg 


arg arg ser 


gin 


val 


arg 


ile 


glu 


cys 


pro 


1201/401 
















1231/411 
















tgg egg ggg 


ctt 


ccg 


cgc 


acc 


gac 


cgc 


gcg 


gcg gcg cct 


gca 


ggg 


cgc 


ggc 


gtt 


gtt 


cat 


trp arg gly 


leu 


pro 


arg 


thr 


asp 


arg 


ala 


ala ala pro 


ala 


gly 


arg 


gly 


val 


val 


his 


1261/421 
















1291/431 
















cat egg tct 


ggg 


gat 


gtt 


ggt 


tte 


egg 


cgt 


ggc gtt caa 


aga 


gac 


cat 


gat 


egg 


aag 


ttt 


his arg ser 


gly 


asp 


val 


gly phe 


arg 


arg 


gly val gin 


arg 


asp 


his 


asp 


arg 


lys 


phe 


1321/441 
















1351/451 
















eee gat act 


cag 


cgt 


ttt 


egg 


ttt 


tgt 


cgt 


gat gtt egg 


tgg 


tgt 


ggt 


gta 


tgc 


cat 


cac 


pro asp thr 


gin 


arg 


phe 


arg 


phe 


cys 


arg 


asp val arg 


trp 


cys 


gly 


val 


cys 


his 


his 


1381/461 
















1411/471 
















egg tec teg 


gtt 


gtc 


egg 


cag 


gat 


gga 


teg 


tgg egg ate 


ggc 


tgc 


tgg 


ggc 


ttc 


gcg 


cca 


arg ser ser 


val 


val 


arg 


gin 


asp 


gly 


ser 


trp arg ile 


gly 


cys 


trp 


gly 


phe 


ala 


pro 


1441/481 
















1471/491 
















gcg teg tac 


caa 


ggg 


ggc 


egg 


ggg 


etc 


att 


cac cag ccg 


tat 


gga 


aga 


tc 








ala ser tyr 


gin 


gly 


gly 


arg gly 


leu 


ile 


his gin pro 


tyr 


gly 


arg 











SEQ ID No,45ZC (continued 1) 
FIGURE 45ZC (continued 1) 



ORF de seq ^ 


i52A 


directement 


en fusion avec phoA 
















cag tct gtc 


ggc 


aag 


gag 


gga 


cgc 


atg 


cca 


etc tec gat 


eat 


gag 


cag 


egg 








gin ser val 


gly 


lys 


glu 


gly 


arg 


met 


pro 


leu ser asp 


his 


glu 


gin 


arg 








1141/381 
















1171/391 
















atg ctt gac 


cag 


ate 


gag 


age 


get 


etc 


tac 


gee gaa gat 


ccc 


aag 


ttc 


gca 


teg 


agt 


gtc 


met leu asp 


gin 


ile 


glu 


ser 


ala 


leu 


tyr 


ala glu asp 


pro 


lys 


phe 


ala 


ser 


ser 


val 


1201/401 
















1231/411 
















cgt ggc ggg 


ggc 


tte 


cgc 


gca 


ccg 


acc 


gcg 


egg egg cgc 


etg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


arg gly gly 


gly 


phe 


arg 


ala 


pro 


thr 


ala 


arg arg arg 


leu 


gin 


gly 


ala 


ala 


leu 


phe 


1261/421 
















1291/431 
















ate ate ggt 


etg 


ggg 


atg 


ttg 


gtt 


tec 


ggc 


gtg gcg tte 


aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ile ile gly 


leu 


gly 


met 


leu 


val 


ser 


gly 


val ala phe 


lys 


glu 


thr 


met 


ile 


gly 


ser 


1321/441 
















1351/451 
















tte ccg ata 


etc 


age 


gtt 


tte 


ggt 


ttt 


gtc 


gtg atg tte 


ggt 


ggt 


gtg 


gtg 


tat 


gcc 


ate 


phe pro ile 


leu 


ser 


val 


phe 


gly 


phe 


val 


val met phe 


gly gly 


val 


val 


tyr 


ala 


ile 


1381/461 
















1411/471 
















ace ggt cct 


egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt ggc gga 


teg 


get 


get 


ggg 


get 


teg 


cgc 


thr gly pro 


arg 


leu 


ser 


gly 


arg 


met 


asp 


arg gly gly 


ser 


ala 


ala 


gly 


ala 


ser 


arg 


1441/481 
















1471/491 
















cag cgt cgt 


acc 


aag 


ggg 


gee 


ggg 


ggc 


tea 


ttc acc age 


cgt 


atg 


gaa 


gat 


c 






gin arg arg 


thr 


lys 


gly 


ala 


gly 


gly 


ser 


phe thr ser 


arg 


met 


glu 


asp 









SEQ ID N0.45A 
FIGURE 45A 

^^^^ ^„„„rr^ j^^Tv v:< 



153/185 



Sequence Rv2169c predicted by Cole et al . , 1998 (Nature 393:537-544) and 
containing Seq45A 

1/1 31/11 



atg cca 


cte 


tec 


gat 


cat 


gag 


eag 


egg 


atg 


ctt gae 


cag 


ate 


gag 


age 


get 


etc 


tac 


gcc 


Met pro 


leu 


ser 


asp 


his 


glu 


gin 


arg 


met 


leu asp 


gin 


ile 


glu 


ser 


ala 


leu 


tyr 


ala 


61/21 


















91/31 


















gaa gat 


ccc 


aag 


ttc 


gea 


teg 


agt 


gtc 


cgt 


ggc ggg 


ggc 


ttc 


cgc 


gea 


ccg 


acc 


gcg 


egg 


glu asp 


pro 


lys 


phe 


ala 


ser 


ser 


val 


arg 


gly gly 


gly phe 


arg 


ala 


pro 


thr 


ala 


arg 


121/41 


















151/51 


















egg cgc 


ctg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


ate 


ate ggt 


ctg 


ggg 


atg 


ttg 


gtt 


tec 


ggc 


gtg 


arg arg 


leu 


gin 


gly 


ala 


ala 


leu 


phe 


ile 


ile gly 


leu 


gly met leu 


val 


ser 


gly 


val 


181/61 


















211/71 


















gcg ttc 


aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ttc 


ccg ata 


etc 


age 


gtt 


ttc 


ggt 


ttt 


gtc 


gtg 


ala phe 


lys 


glu 


thr 


met 


ile 


gly 


ser 


phe 


pro ile 


leu 


ser 


val 


phe 


gly 


phe 


val 


val 


241/81 


















271/91 


















atg ttc 


ggt 


ggt 


gtg 


gtg 


tat 


gee 


ate 


acc 


ggt cct 


egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt 


met phe 


gly 


gly 


val 


val 


tyr 


ala 


ile 


thr 


gly pro 


arg 


leu 


ser 


gly 


arg 


met 


asp 


arg 


g 301/101 


















331/111 




















teg 


get 


get 


ggg 


get 


teg 


cgc 


cag 


cgt cgt 


ace 


aag 


ggg 


gcc 


ggg 


ggc 


tea 


ttc 


gly gly 


ser 


ala 


ala 


gly 


ala 


ser 


arg 


gin 


arg arg 


thr 


lys 


gly 


ala 


gly 


gly 


ser 


phe 


^ 361/121 


















391/131 


















acc age 


cgt 


atg 


gaa 


gat 


egg 


ttc 


egg 


cgc 


cgc ttc 


gae 


gag 


taa 












thr ser 


arg 


met 


glu 


asp 


arg phe 


arg 


arg 


arg phe 


asp 


glu 


OCH 













SEQ ID N0.45D 
FIGURE 45D 



ORF according to Cole et al., 1998 {Nature 393:537-544) and containing Rv2169c 



1/1 


















31/11 


















tga cag 


tct 


gtc 


ggc 


aag 


gag 


gga 


cgc 


atg 


cca etc 


tec 


gat 


cat 


gag 


cag 


egg 


atg 


ett 


OPA gin 


ser 


val 


gly 


lys 


glu 


gly 


arg 


met 


pro leu 


ser 


asp 


his 


glu 


gin 


arg 


met 


leu 


61/21 


















91/31 


















gae cag 


ate 


gag 


age 


get 


cte 


tac 


gee 


gaa 


gat ccc 


aag 


ttc 


gea 


teg 


agt 


gtc 


cgt 


ggc 


asp gin 


ile 


glu 


ser 


ala 


leu 


tyr 


ala 


glu 


asp pro 


lys 


phe 


ala 


ser 


ser 


val 


arg 


gly 


121/41 


















151/51 


















ggg ggc 


ttc 


cgc 


gea 


ccg 


acc 


gcg 


egg 


egg 


cgc ctg 


eag 


ggc 


gcg 


gcg 


ttg 


ttc 


ate 


ate 


gly gly 


phe 


arg 


ala 


pro 


thr 


ala 


arg 


arg 


arg leu 


gin 


gly 


ala 


ala 


leu 


phe 


ile 


ile 


181/61 


















211/71 


















ggt ctg 


ggg 


atg 


ttg 


gtt 


tec 


ggc 


gtg 


gcg 


ttc aaa 


gag 


ace 


atg 


ate 


gga 


agt 


ttc 


ccg 


gly leu 


gly 


met 


leu 


val 


ser 


gly 


val 


ala 


phe lys 


glu 


thr 


met 


ile 


gly 


ser 


phe 


pro 


241/81 


















271/91 


















ata etc 


age 


gtt 


ttc 


ggt 


ttt 


gtc 


gtg 


atg 


ttc ggt 


ggt 


gtg 


gtg 


tat 


gcc 


ate 


ace 


ggt 


ile leu 


ser 


val 


phe 


gly 


phe 


val 


val 


met 


phe gly 


gly 


val 


val 


tyr 


ala 


ile 


thr 


gly 


301/101 


















331/111 


















cct egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt 


ggc 


gga teg 


get 


get 


ggg 


get 


teg 


cgc 


cag 


cgt 


pro arg 


leu 


ser 


gly 


arg 


met 


asp 


arg 


gly 


gly ser 


ala 


ala 


gly ala 


ser 


arg 


gin 


arg 


361/121 


















391/131 


















cgt ace 


aag 


ggg 


gee 


ggg 


ggc 


tea 


ttc 


ace 


age cgt 


atg 


gaa 


gat 


egg 


ttc 


egg 


cgc 


cgc 


arg thr 


lys 


gly 


ala 


gly 


gly 


ser 


phe 


thr 


ser arg 


met 


glu 


asp 


arg 


phe 


arg 


arg 


arg 



421/141 

ttc gae gag taa 
phe asp glu OCH 



SEQ ID N0.45F 
FIGURE 45F 



154/185 



1/1 

cag ccg cgc cgc ate gac cag 
gin pro arg arg ile asp gin 
61/21 

tec ggt gaa cat cgc acc agg 
ser gly glu his arg thr arg 
121/41 

ccg gcc aac tea cag aca ccc 
pro ala asn ser gin thr pro 
181/61 

cgt cgc ate gcc gtc etc gee 
arg arg ile ala val leu ala 
241/81 

teg ggc teg aag cea age gge 
ser gly ser lys pro ser gly 
301/101 

acc gcg cag acc aag get etc 
thr ala gin thr lys ala leu 



31/11 

ggc etc acg ccc ggt cac 
gly leu thr pro gly his 
91/31 

tta ggc age aat ccc gcg 
leu gly ser asn pro ala 

151/51 
tct acg atg cag ggt atg 
ser thr met gin gly met 

211/71 
gee gtt age ate gee gee 
ala val ser ile ala ala 

271/91 
gga eca ctt ccg gac gcg 
gly pro leu pro asp ala 

331/111 
aag age gcg cac atg gtg 
lys ser ala his met val 



SEQ ID N0.46A 
FIGURE 46A 



ttc tec gcg ttc etc aac aat 
phe ser ala phe leu asn asn 

gac ccg cac ccc act cgc cga 
asp pro his pro thr arg arg 

egg acc ccc aga cgc cac tgc 
arg thr pro arg arg his cys 

act gtc gtt gee ggc tgc teg 
thr val val ala gly cys ser 

aag ccg ctg gtc gag gag gcc 
lys pro leu val glu glu ala 

ctg acg gtc aac ggc aag ate 
leu thr val asn gly lys ile 



1/1 


















31/11 


















age cgc 


gcc 


gca 


teg 


acc 


agg 


gcc 


tea 


cgc 


ccg gtc 


act 


tct 


ccg 


cgt 


tec 


tea 


aca 


att 


ser arg 


ala 


ala 


ser 


thr 


arg 


ala 


ser 


arg 


pro val 


thr 


ser 


pro 


arg 


ser 


ser 


thr 


ile 


61/21 


















91/31 


















ccg gtg 


aac 


ate 


gca 


eca 


ggt 


tag 


gca 


gca 


ate ccg 


egg 


acc 


cgc 


acc 


cea 


etc 


gcc 


gac 


pro val 


asn 


ile 


ala 


pro 


gly AMB 


ala 


ala 


ile pro 


arg 


thr 


arg 


thr 


pro 


leu 


ala 


asp 


121/41 


















151/51 


















egg cea 


act 


cac 


aga 


cac 


cet 


eta 


cga 


tgc 


agg gta 


tgc 


gga 


ccc 


cea 


gac 


gcc 


act 


gee 


arg pro 


thr 


his 


arg 


his 


pro 


leu 


arg 


cys 


arg val 


cys 


gly 


pro 


pro 


asp 


ala 


thr 


ala 


181/61 


















211/71 


















gtc gca 


teg 


ccg 


tec 


teg 


ccg 


ccg 


tta 


gca 


teg ccg 


cea 


ctg 


teg 


ttg 


ccg 


get 


get 


cgt 


val ala 


ser 


pro 


ser 


ser 


pro 


pro 


leu 


ala 


ser pro 


pro 


leu 


ser 


leu 


pro 


ala 


ala 


arg 


241/81 


















271/91 


















egg get 


cga 


age 


caa 


gcg 


gcg 


gac 


cac 


ttc 


egg acg 


cga 


age 


cgc 


tgg 


teg 


agg 


agg 


cea 


arg ala 


arg 


ser 


gin 


ala 


ala 


asp 


his 


phe 


arg thr 


arg 


ser 


arg 


trp 


ser 


arg 


arg 


pro 


301/101 


















331/111 


















ccg cgc 


aga 


eca 


agg 


etc 


tea 


aga 


gcg 


cgc 


aca tgg 


tgc 


tga 


egg 


tea 


acg 


gca 


aga 


tc 


pro arg 


arg 


pro 


arg 


leu 


ser 


arg 


ala 


arg 


thr trp 


cys 


OPA 


arg 


ser 


thr 


ala 


arg 





SEQ ID No,46B 
FIGURE 46B 



155/185 



1/1 

gcc gcg ccg cat cga cca 
ala ala pro his arg pro 
61/21 

egg tga aca teg cac cag 
arg OPA thr ser his gin 
121/41 

ggc caa etc aca gac acc 
gly gin leu thr asp thr 
181/61 

teg cat egc cgt cct cgc 
ser his arg arg pro arg 
241/81 

ggg etc gaa gcc aag egg 
gly leu glu ala lys arg 
301/101 

cgc gca gac caa ggc tet 
arg ala asp gin gly ser 



31/11 

ggg cct cac gcc egg tea ctt 
gly pro his ala arg ser leu 
91/31 

gtt agg cag caa tec cgc gga 
val arg gin gin ser arg gly 
151/51 

etc tac gat gca ggg tat gcg 
leu tyr asp ala gly tyr ala 
211/71 

cgc cgt tag cat cgc cgc cac 
arg arg AMB his arg arg his 
271/91 

egg ace act tee gga egc gaa 
arg thr thr ser gly arg glu 

331/111 
caa gag cgc gca cat ggt get 
gin glu arg ala his gly ala 



etc cgc gtt cct caa caa ttc 
leu arg val pro gin gin phe 

ccc gca ccc cac teg ccg acc 
pro ala pro his ser pro thr 

gac ccc cag acg cca etg ccg 
asp pro gin thr pro leu pro 

tgt cgt tgc egg ctg etc gtc 
cys arg cys arg leu leu val 

gcc get ggt cga gga ggc cac 
ala ala gly arg gly gly his 

gac ggt caa egg caa gat c 
asp gly gin arg gin asp 



SEQ ID N0.46C 
FIGURE 46C 



156/185 



Coding sequence Rvl411c 
containing seq4 6A: 

1/1 

atg egg acc ccc aga cgc 
Met arg thr pro arg arg 
61/21 

gcc act gtc gtt gcc ggc 
ala thr val val ala gly 
121/41 

gcg aag ccg ctg gtc gag 
ala lys pro leu val glu 
181/61 

gtg ctg acg gtc aac ggc 
val leu thr val asn gly 
241/81 

acc acc aac ccc acc gcc 
thr thr asn pro thr ala 
301/101 

gat gcc gac ttc gtg gtg 
asp ala asp phe val val 
361/121 

age gat ttc ggt ccc gcc 
ser asp phe gly pro ala 
421/141 

ggc ctg gcc aac gtg ctg 
gly leu ala asn val leu 
481/161 

aac ggc cag aac acc ate 
asn gly gin asn thr ile 
541/181 

gcg ccg ccg ttc aac gcg 
ala pro pro phe asn ala 
601/201 

gat cat caa ctg gca cag 
asp his gin leu ala gin 
661/221 

ttg teg aaa tgg ggc gag 
leu ser lys trp gly glu 



predicted by Cole et al,, 1 



31/11 



cac 


tgc 


cgt 


cgc 


ate gcc 


gtc 


his 


cys 


arg 


arg 


ile ala 


val 










91/31 




tgc 


teg 


teg 


ggc 


teg aag 


cca 


cys 


ser 


ser 


gly 


ser lys 


pro 










151/51 




gag 


gcc 


acc 


gcg 


cag ace 


aag 


glu 


ala 


thr 


ala 


gin thr 


lys 










211/71 




aag 


ate 


ccg 


gga 


ctg tet 


ctg 


lys 


ile 


pro 


gly 


leu ser 


leu 










271/91 




gcg 


acg 




aac 


gtc aag 


etc 


ala 


thr 


gly 


asn 


val lys 


leu 










331/111 




ttc 


gac 




ate 


ctg tac 


gcc 


phe 


asp 


gly 


ile 


leu tyr 


ala 










391/131 




gcc 


gac 


ate 


tac 


gac ccc 


gcc 


ala 


asp 


ile 


tyr 


asp pro 


ala 










451/151 




gcg 


aat 


ttc 


gcc 


gac gca 


aaa 


ala 


asn 


phe 


ala 


asp ala 


lys 










511/171 




cgc 


ate 


age 


ggg 


aag gta 


teg 


arg 


ile 


ser 


gly lys val 


ser 










571/191 




acg 


cag 


ccg 


gtg 


ccg gcg 


acc 


thr 


gin 


pro 


val 


pro ala 


thr 










631/211 




gcc 


cag 


ttg 


gac 


cgc ggc 


teg 


ala 


gin 


leu 


asp 


arg gly 


ser 










691/231 




aag 


gtc 


cag 


gtc 


acg aag 


ccc 


lys 


val 


gin 


val 


thr lys 


pro 



SEQ ID N0.46D 
FIGURE 4 6D 



8 (Nature 393: 537-544) and 



etc gcc gcc gtt age ate gcc 
leu ala ala val ser ile ala 

age ggc gga cca ctt ccg gac 
ser gly gly pro leu pro asp 

get etc aag age gcg cac atg 
ala leu lys ser ala his met 

aag acg ctg age ggc gat etc 
lys thr leu ser gly asp leu 

acg ctg ggt ggg tet gat ate 
thr leu gly gly ser asp ile 

acc ctg acg ccc aac cag tgg 
thr leu thr pro asn gin trp 

cag gtg ctg aat ccg gat acc 
gin val leu asn pro asp thr 

gee gaa ggg egg gat acc ate 
ala glu gly arg asp thr ile 

gca cag gcg gtg aac cag ata 
ala gin ala val asn gin ile 

gtc tgg att cag gag acc ggc 
val trp ile gin glu thr gly 

ggc aat tec gtc cag atg acc 
gly asn ser val gin met thr 

ccg gtg age tga 
pro val ser OPA 



157/185 



ORF according to Cole et ai., 1998 (Nature 393: 537-544): 
and containing the coding sequence Rvl411c: 



1/1 

tag etc acc cag 
AMB leu thr gin 
61/21 

gtc ggg caa tac 
val gly gin tyr 
121/41 

gcc ggc gtc ggc 
ala gly val gly 
181/61 

etc acg ccc ggt 
leu thr pro gly 
241/81 

ggc age aat ccc 
gly ser asn pro 
301/101 

acg atg cag ggt 
thr met gin gly 
361/121 

gtt age ate gcc 
val ser ile ala 
421/141 

cca ctt ccg gac 
pro leu pro asp 
481/161 

age gcg cac atg 
ser ala his met 
541/181 

age ggc gat etc 
ser gly asp leu 
601/201 

ggg tct gat ate 
gly ser asp ile 
661/221 

ccG aac cag tgg 
pro asn gin trp 
721/241 

aat ccg gat acc 
asn pro asp thr 
781/261 

egg gat acc ate 
arg asp thr ile 
841/281 

gtg aac cag ata 
val asn gin ile 
901/301 

cag gag acc ggc 
gin glu thr gly 
961/321 

gtc cag atg ace 

val gin met thr 

1021/341 

tga 

OPA 



gtt gga ccg gtt 
val gly pro val 

ate gac gac cgt 
ile asp asp arg 

ggt aac eat egg 
gly asn his arg 

cac ttc tec gcg 
his phe ser ala 

gcg gac ccg cac 
ala asp pro his 

atg egg acc ccc 
met arg thr pro 

gcc act gtc gtt 
ala thr val val 

gcg aag ccg ctg 
ala lys pro leu 

gtg ctg acg gtc 
val leu thr val 

acc ace aac ccc 
thr thr asn pro 

gat gcc gac ttc 
asp ala asp phe 

age gat ttc ggt 
ser asp phe gly 

ggc ctg gcc aac 
gly leu ala asn 

aac ggc cag aac 
asn gly gin asn 

gcg ccg ccg ttc 
ala pro pro phe 

gat cat caa ctg 
asp his gin leu 

ttg teg aaa tgg 
leu ser lys trp 



31/11 
cag tgt etc ggc 
gin cys leu gly 

91/31 
cag aca cac gee 
gin thr his ala 

151/51 
acc gcg gat ggt 
thr ala asp gly 

211/71 
ttc etc aac aat 
phe leu asn asn 

271/91 
ccc act cgc ega 
pro thr arg arg 

331/111 
aga cgc cac tgc 
arg arg his cys 

391/131 
gcc ggc tgc teg 
ala gly cys ser 

451/151 
gtc gag gag gee 
val glu glu ala 

511/171 
aac ggc aag ate 
asn gly lys ile 

571/191 
acc gcc gcg acg 
thr ala ala thr 

631/211 
gtg gtg ttc gac 
val val phe asp 

691/231 
ccc gcc gcc gac 
pro ala ala asp 

751/251 
gtg ctg gcg aat 
val leu ala asn 

811/271 
acc ate cgc ate 
thr ile arg ile 

871/291 
aac gcg acg cag 
asn ala thr gin 

931/311 
gea cag gcc cag 
ala gin ala gin 

991/331 
ggc gag aag gtc 
gly glu lys val 



cat 
his 


cac 
his 


gtc 
val 


ggc 
gly 


gtt 
val 


gac 
asp 


age 
ser 


gat 
asp 


cag 
gin 


ccg 
pro 


cgc 
arg 


cgc 
arg 


tec 
ser 


ggt 
gly 


gaa 
glu 


cat 
his 


ccg 
pro 


gcc 
ala 


aac 
asn 


tea 
ser 


cgt 
arg 


cgc 
arg 


ate 
ile 


gcc 
ala 


teg 
ser 


ggc 
gly 


teg 
ser 


aag 
lys 


acc 
thr 


gcg 
ala 


cag 
gin 


acc 
thr 


ccg 
pro 


gga ctg 
gly leu 


tct 
ser 


gga 
gly 


aac 
asn 


gtc 
val 


aag 
lys 


ggg 

gly 


ate 
ile 


ctg 
leu 


tac 
tyr 


ate 
ile 


tac 
tyr 


gac 
asp 


ccc 
pro 


ttc 
phe 


gcc 
ala 


gac 
asp 


gea 
ala 


age 
ser 


ggg aag 
gly lys 


gta 
val 


ccg 
pro 


gtg 
val 


ccg 
pro 


gcg 
ala 


ttg 
leu 


gac 
asp 


cgc 
arg 


ggc 
gly 


cag 
gin 


gtc 
val 


acg 
thr 


aag 
lys 



ggt gaa ttg gcc 
gly glu leu ala 

ega gtc gcc gtg 
arg val ala val 

ate gac cag ggc 
ile asp gin gly 

cgc acc agg tta 
arg thr arg leu 

cag aca ccc tct 
gin thr pro ser 

gtc etc gcc gcc 
val leu ala ala 

cca ago ggc gga 
pro ser gly gly 

aag get etc aag 
lys ala leu lys 

ctg aag acg ctg 
leu lys thr leu 

etc acg ctg ggt 
leu thr leu gly 

gcc acc ctg acg 
ala thr leu thr 

gcc cag gtg ctg 
ala gin val leu 

aaa gcc gaa ggg 
lys ala glu gly 

teg gea cag gcg 
ser ala gin ala 

acc gtc tgg att 
thr val trp ile 

teg ggc aat tec 
ser gly asn ser 

ccc ccg gtg age 
pro pro val ser 



SEQ ID N0.46F 



FIGURE 46F 



158/185 



1/1 

gag ctg gtc aac ggc gcc ggc 
glu leu val asn gly ala gly 
61/21 

ctg gcc gat gcc cag cag atg 
leu ala asp ala gin gin met 
121/41 

gtg ttg gtg gcc teg ggc age 
val leu val ala ser gly ser 
181/61 

ttc gac get gtg atg gac gcg 
phe asp ala val met asp ala 
241/81 

egg gtg ctg etc gag cag ggt 
arg val leu leu glu gin gly 
301/101 

ggg ttg ggc aat gcc gcc ggt 
gly leu gly asn ala ala gly 



1/1 

age tgg tea acg gcg ccg gca 
ser trp ser thr ala pro ala 
61/21 

tgg ccg atg ecc age aga tgg 
trp pro met pro ser arg trp 
121/41 

tgt tgg tgg cct egg gca gca 
cys trp trp pro arg ala ala 
181/61 

teg acg ctg tga tgg acg cga 
ser thr leu OPA trp thr arg 
241/81 

ggg tgc tgc teg age agg gtc 
gly cys cys ser ser arg val 
301/101 

ggt tgg gca atg ccg ccg gtt 
gly trp ala met pro pro val 



31/11 

ate gac gac gcc gcc gtc 
lie asp asp ala ala val 
91/31 

gtc gag gcg gca ctg ggc 
val glu ala ala leu gly 

151/51 
aac cat gtg gcg ccc att 
asn his val ala pro ile 

211/71 
aac gtg egg ggt gcc tgg 
asn val arg gly ala trp 

271/91 
cag ggc ggc age gtg gtg 
gin gly gly ser val val 

331/111 
tac age gcg tac tgc ccg 
tyr ser ala tyr cys pro 



SEQ ID N0.47A 
FIGURE 47A 









31/11 




teg 


acg 


acg 


ccg ccg 


teg 


ser 


thr 


thr 


pro pro 


ser 








91/31 




teg 


agg 


egg 


eac tgg 


gcc 


ser 


arg 


arg 


his trp 


ala 








151/51 




acc 


atg 


tgg 


cgc eca 


tta 


thr 


met 


trp 


arg pro 


leu 








211/71 




acg 


tgc 


ggg 


gtg cct 


ggc 


thr 


cys 


gly 


val pro 


gly 








271/91 




agg 


gcg 


gca 


gcg tgg 


tgc 


arg 


ala 


ala 


ala trp 


cys 








331/111 




aca 


gcg 


cgt 


act gee 


cgt 


thr 


ala 


arg 


thr ala 


arg 



gtg 
val 


acc 
thr 


tgc 
cys 


egg 
arg 


ccg 
pro 


gac 
asp 


age 
ser 


cga 
arg 


tat 
tyr 


ggc 
gly 


cgt 
arg 


ttg 
leu 


gac 
asp 


gga 
gly 


acc 
thr 


gag atg 
glu met 


gcc 
ala 


gtc 
val 


gag 
glu 


gac 
asp 


ctg 
leu 


gtg 

val 


tgt 
cys 


egg 
arg 


gcg 
ala 


gee 
ala 


gga 
gly 


ctg 
leu 


gtg 
val 


teg 
ser 


tec 
ser 


gtt 
val 


cgc ggc 
arg gly 


teg 
ser 


aag 
lys 


gcg 

ala 


ggc 
gly 


acc 
thr 


gat 
asp 


e 



tga 
OPA 


cct 
pro 


gcc 
ala 


ggc 
gly 


egg 
arg 


aca 
thr 


gcc 
ala 


gat 
asp 


atg 
met 


gcc 
ala 


gtt 
val 


tgg 
trp 


acg 
thr 


gag 
glu 


ccg 
pro 


aga 
arg 


tgg ccg 
trp pro 


teg 
ser 


agg 
arg 


act 
thr 


tgg 
trp 


tgt 
cys 


gtc 
val 


ggg 

gly 


egg 
arg 


ccg 
pro 


gac 
asp 


tgg 
trp 


tgt 
cys 


cgt 
arg 


ccg 
pro 


ttc 
phe 


gcg 
ala 


gcg 
ala 


cga 
arg 


agg 
arg 


egg 
arg 


gca 
ala 


ccg 
pro 


ate 
ile 





SEQ ID N0.47B 
FIGURE 47B 



159/185 



1/1 






get ggt 


caa 


egg 


ala gly 


gin 


arg 


61/21 






ggc cga 


tgc 


cca 


gly arg 


cys 


pro 


121/41 






gtt ggt 


ggc 


etc 


val gly 


gly 


leu 


181/61 






cga cgc 


tgt 


gat 


arg arg 


cys 


asp 


241/81 






ggt get 


get 


cga 


gly ala 


ala 


arg 


301/101 






gtt ggg 


caa 


tgc 


val gly 


gin 


cys 



cgc egg cat cga 
arg arg his arg 

gca gat ggt cga 
ala asp gly arg 

ggg cag caa cca 
gly gin gin pro 

gga cgc gaa cgt 
gly arg glu arg 

gca ggg tea ggg 
ala gly ser gly 

cgc egg tta cag 
arg arg leu gin 



31/11 
cga cgc cgc cgt 
arg arg arg arg 

91/31 
ggc ggc act ggg 
gly gly thr gly 

151/51 
tgt ggc gcc cat 
cys gly ala his 

211/71 
gcg ggg tgc ctg 
ala gly cys leu 

271/91 
egg cag cgt ggt 
arg gin arg gly 

331/111 
cgc gta ctg ccc 
arg val leu pro 



cgt gac ctg ecg 
arg asp leu pro 

ecg ata tgg ccg 
pro ile trp pro 

tac cga gat ggc 
tyr arg asp gly 

get ggt gtg teg 
ala gly val ser 

get ggt gtc gtc 
ala gly val val 

gtc gaa ggc ggg 
val glu gly gly 



gcc gga cag cct 
ala gly gin pro 

ttt gga egg agt 
phe gly arg ser 

cgt cga gga ctt 
arg arg gly leu 

ggc ggc egg acg 
gly gly arg thr 

cgt teg egg egg 
arg ser arg arg 

cac cga tc 
his arg 



SEQ ID N0.47C 
FIGURE 47C 
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Coding sequence Rvl714 predicted by Cole et al . , 1998 (Nature 393: 537-544) and 
containing seq 47A: 

1/1 31/11 

gtg gag gaa atg gcg ctg get cag cag gtg ccg aac ctg ggt ctg gcg cgc ttc age gtg 
val glu glu met ala leu ala gin gin val pro asn leu gly leu ala arg phe ser val 
61/21 91/31 

cag gac aag teg ate ctg ate aec ggc gcg ace ggt teg ttg ggc cga gtt gcc gcc egg 
gin asp lys ser ile leu ile thr gly ala thr gly ser leu gly arg val ala ala arg 
121/41 151/51 

gcg ctg gcc gac gcg gga gcg egg ctg aca ctg gcc ggc ggc aac teg gee ggt ctg gcc 
ala leu ala asp ala gly ala arg leu thr leu ala gly gly asn ser ala gly leu ala 
181/61 211/71 

gag ctg gte aac ggc gee ggc ate gac gac gcc gee gte gtg aec tge egg ccg gac age 
qlu leu val asn gly ala gly ile asp asp ala ala val val thr cys arg pro asp ser 
241/81 271/91 

ctg gee gat gcc cag cag atg gte gag geg gea ctg ggc cga tat ggc egt ttg gac gga 
leu ala asp ala gin gin met val glu ala ala leu gly arg tyr gly arg leu asp gly 
301/101 331/111 

gtg ttg gtg gcc teg ggc age aac cat gtg gcg ccc att aec gag atg gcc gte gag gac 
val leu val ala ser gly ser asn his val ala pro ile thr glu met ala val glu asp 
361/121 391/131 

ttc gac get gtg atg gac gcg aac gtg egg ggt gcc tgg ctg gtg tgt egg geg gee gga 
phe asp ala val met asp ala asn val arg gly ala trp leu val cys arg ala ala gly 
421/141 451/151 

egg gtg ctg etc gag cag ggt cag ggc ggc age gtg gtg ctg gtg teg tec gtt cgc ggc 
arg val leu leu glu gin gly gin gly gly ser val val leu val ser ser val arg gly 
481/161 511/171 

qgg ttg ggc aat gee gcc ggt tac age geg tac tge ccg teg aag gcg ggc aec gat ctg 
qly leu gly asn ala ala gly tyr ser ala tyr cys pro ser lys ala gly thr asp leu 
541/181 571/191 

ttg gcc aag aca ttg gcg gcc gaa tgg ggc ggt cae ggc att egg gtg aac gcg ctg gcg 
leu ala lys thr leu ala ala glu trp gly gly his gly ile arg val asn ala leu ala 
601/201 631/211 

ccg aeg gtg ttt egg tec geg gtg ace gag tgg atg ttc ace gac gat ccg aag gge egg 
pro thr val phe arg ser ala val thr glu trp met phe thr asp asp pro lys gly arg 
661/221 691/231 

gcc acc egg gag gcg atg etc gee egg ate ccg ttg cgc cgc ttc gee gaa ccg gaa gae 
ala thr arg glu ala met leu ala arg ile pro leu arg arg phe ala glu pro glu asp 
721/241 751/251 

ttc gte gge gcc ctg ate tat ctg etc age gac gcc teg age ttc tac acc ggc cag gtg 
phe val gly ala leu ile tyr leu leu ser asp ala ser ser phe tyr thr gly gin val 
781/261 811/271 
atg tat ctg gac ggc ggg tac acc gea tge tga 
met tyr leu asp gly gly tyr thr ala cys OPA 



SEQ ID N0.47D 
FIGURE 47D 



161/185 



ORF according to Cole et al., 1998 (Nature 393: 537-544) and containing the 
coding sequence Rvl714: 

24/1 54/11 

tag gtg gag gaa atg gcg ctg got cag cag gtg ccg aac ctg ggt ctg gcg cgc ttc age 
AMB val glu glu met ala leu ala gin gin val pro asn leu gly leu ala arg phe ser 
84/21 114/31 

qtq cag gac aag tog ate ctg ate ace ggc gcg acc ggt teg ttg ggc cga gtt gee gee 
val gin asp lys ser ile leu ile thr gly ala thr gly ser leu gly arg val ala ala 
144/41 174/51 

egg gcg ctg gee gac gcg gga gcg egg ctg aea ctg gee ggc ggc aae teg gee ggt etg 
arg ala leu ala asp ala gly ala arg leu thr leu ala gly gly asn ser ala gly leu 
204/61 234/71 

gee gag ctg gtc aac ggc gcc ggc ate gac gac gee gcc gtc gtg acc tgc egg ccg gac 
ala qlu leu val asn gly ala gly ile asp asp ala ala val val thr cys arg pro asp 
264/81 294/91 

age ctg gcc gat gcc cag cag atg gtc gag gcg gea etg ggc ega tat ggc egt ttg gac 
ser leu ala asp ala gin gin met val glu ala ala leu gly arg tyr gly arg leu asp 
324/101 354/111 

gga gtg ttg gtg gcc teg ggc age aae cat gtg gcg cec att acc gag atg gcc gtc gag 
gly val leu val ala ser gly ser asn his val ala pro ile thr glu met ala val glu 
384/121 414/131 

gac ttc gac get gtg atg gac gcg aac gtg egg ggt gee tgg etg gtg tgt egg gcg gcc 
asp phe asp ala val met asp ala asn val arg gly ala trp leu val cys arg ala ala 
444/141 474/151 

qqa egg gtg ctg etc gag cag ggt cag ggc ggc age gtg gtg ctg gtg teg tec gtt cgc 
gly arg val leu leu glu gin gly gin gly gly ser val val leu val ser ser val arg 
lo4/161 534/171 

qqc ggg ttg ggc aat gee gcc ggt tac age gcg tac tgc ccg teg aag gcg ggc acc gat 
qly gly leu gly asn ala ala gly tyr ser ala tyr cys pro ser lys ala gly thr asp 
§64/181 594/191 

ctg ttg gcc aag aca ttg gcg gcc gaa tgg ggc ggt cac ggc att egg gtg aae gcg ctg 
leu leu ala lys thr leu ala ala glu trp gly gly his gly ile arg val asn ala leu 
624/201 654/211 

gcg ccg aeg gtg ttt egg tec gcg gtg acc gag tgg atg ttc acc gac gat ccg aag ggc 
ala pro thr val phe arg ser ala val thr glu trp met phe thr asp asp pro lys gly 
684/221 714/231 

egg gcc acc egg gag gcg atg etc gcc egg ate ccg ttg cgc cgc ttc gcc gaa ccg gaa 
arg ala thr arg glu ala met leu ala arg ile pro leu arg arg phe ala glu pro glu 
744/241 774/251 

gac ttc gtc ggc gcc ctg ate tat etg etc age gac gcc teg age ttc tac acc ggc cag 
asp phe val gly ala leu ile tyr leu leu ser asp ala ser ser phe tyr thr gly gin 
804/261 834/271 
gtg atg tat ctg gac ggc ggg tac acc gca tgc tga 
val met tyr leu asp gly gly tyr thr ala cys OPA 



SEQ ID N0.47F 
FIGURE 47F 
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1/1 31/11 

agg etc atg age aag acg gtt etc ate ctt ggc geg ggt gte ggc ggc ctg aec ace gee 

arg leu met ser lys thr val leu lie leu gly ala gly val gly gly leu thr thr ala 

61/21 91/31 

gac acc etc cgt caa ctg eta cca ect gag gat e 

asp thr leu arg gin leu leu pro pro glu asp 



SEQ ID No,48A 
FIGURE 48A 



1/1 

ggc tea tga gca aga egg ttc 
gly ser OPA ala arg arg phe 
61/21 

aea cce tec gte aae tgc tac 
thr pro ser val asn cys tyr 



31/11 

tea tec ttg gcg egg gtg 
ser ser leu ala arg val 

91/31 
cac ctg agg ate 
his leu arg lie 



SEQ ID No,48B 
FIGURE 48B 



teg geg gee tga cca eeg eeg 
ser ala ala OPA pro pro pro 



1/1 31/11 

get cat gag caa gae ggt tet cat ect tgg cgc ggg tgt egg egg ect gac cac ege ega 
ala his glu gin asp gly ser his pro trp arg gly cys arg arg pro asp his arg arg 
61/21 

cac ect eeg tea act get ace acc tga gga tc 
his pro pro ser thr ala thr thr OPA gly 



SEQ ID N0.48C 
FIGURE 48C 
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Coding sequence Rv0331 predicted by Cole et al., 1998 (Nature 393: 537-544) and 
containing seq48A: 



1/1 

atg age aag acg gtt 
Met ser lys thr val 
61/21 

etc cgt caa ctg eta 
leu arg gin leu leu 
121/41 

acg ctg ggc ttg teg 
thr leu gly leu ser 
181/61 

gtc cgc ccc acc gcg 
val arg pro thr ala 
241/81 

att gac ate gcg gee 
ile asp ile ala ala 
301/101 

gtg ate gca tta ggt 
val ile ala leu gly 
361/121 

gac gcc gac gtc gcg 
asp ala asp val ala 
421/141 

gtc gag gcg etc gag 
val glu ala leu glu 
481/161 

cca gcc gca ccg ttc 
pro ala ala pro phe 
541/181 

acc gga acc gta cag 
thr gly thr val gin 
601/201 

ccc gag gtc ggc gag 
pro glu val gly glu 
661/221 

cgc aag gcc eta get 
arg lys ala leu ala 
721/241 

tec gaa ccg ttc gat 
ser glu pro phe asp 
781/261 

tea gcg ggt etc age 
ser ala gly leu ser 
841/281 

gcc gac aac gtg tgg 
ala asp asn val trp 
901/301 

ctg ccc aag get gcc 
leu pro lys ala ala 
961/321 

cgc cat etc ggt tac 
arg his leu gly tyr 
1021/341 

gag acc ggt gat cac 
glu thr gly asp his 
1081/361 

teg gtg acg ctg tac 
ser val thr leu tyr 
1141/381 

gcc tgg ctg acc cgc 
ala trp leu thr arg 



etc 
leu 


ate 
ile 


ctt 
leu 


ggc 

gly 


gcg 
ala 


cca 
pro 


cct 
pro 


gag 
glu 


gat 
asp 


cga 
arg 


ttg 
leu 


eta 
leu 


tgg 
trp 


gtg 
val 


ttg 
leu 


gcg teg 
ala ser 


Ctg 
leu 


ccc ggt 
pro gly 


cag gta 
gin val 


gtg 
val 


cac 
his 


acc 
thr 


gcg 
ala 


gcg 
ala 


ctg 
leu 


aac 
asn 


acc 
thr 


ggc cag 
gly gin 


ttc 
phe 


tac 
tyr 


acc 
thr 


cat 
his 


ggc 
gly 


egg 
arg 


ate 
ile 


get 
ala 


gaa gcg 
glu ala 


gcg 
ala 


ttt 
phe 


ctg 
leu 


ate 
ile 


gac 
asp 


acg 
thr 


ttc 
phe 


acg 
thr 


get 
ala 


ttg 
leu 


gtc 
val 


teg 
ser 


atg 
met 


cgc 
arg 


gtc 
val 


gat 
asp 


gag 
glu 


gcc 
ala 


ctg ctt 
leu leu 


gcc 
ala 


gtg 
val 


gtc 
val 


gaa 
glu 


tee 
ser 


ggg 

crl V 


tgg 
trp 


ata 
ile 


gcc 
ala 


ate 
ile 


ggc 
gly 


gat 
asp 


gcg 
ala 


gtg 
val 


ttc 
phe 


gcc 
ala 


gaa gcc 
glu ala 


gac 
asp 


gta 
val 


get 
ala 


gag cgc 
glu arg 


cag 
gin 


gca 
ala 


gee 
ala 


aag 
lys 


ggc 
gly 


ccg 
pro 


ccg 
pro 


teg 
ser 


egg gag 
arg glu 


tgg 
trp 


aag 
lys 


acg 
thr 


tga 
CPA 





31/11 

ggt gtc ggc ggc ctg 
gly val gly gly leu 
91/31 

ate ata ttg gtg gac 
ile ile leu val asp 
151/51 

egg ggc tgg egg egg 
arg gly trp arg arg 
211/71 

gtg gaa atg gtt act 
val glu met val thr 
271/91 

gac aac age gtc ate 
asp asn ser val ile 
331/111 

gac gcc gtt ccc gga 
asp ala val pro gly 
391/131 

ctg gac ggc gcg get 
leu asp gly ala ala 
451/151 

gtg get ate gcc ggg 
val ala ile ala gly 
511/171 

ate gcc gcc caa etc 
ile ala ala gin leu 
571/191 

cct gac ccg ctg ccg 
pro asp pro leu pro 
631/211 

etc aag gat cac ggt 
leu lys asp his gly 
691/231 

gca agg acg atg cac 
ala arg thr met his 
751/251 

ccc ccg cac gtg ccc 
pro pro his val pro 
811/271 

ccc gtg gac ccg cgc 
pro val asp pro arg 
871/291 

acc gtg ctg acg ctg 
thr val leu thr leu 
931/311 

cag gcc gca gtt gtc 
gin ala ala val val 
991/331 

cac ttc ace ggc acg 
his phe thr gly thr 
1051/351 

gac ggc gat ttc ttc 
asp gly asp phe phe 
1111/371 

ttt cac gag gag aag 
phe his glu glu lys 



acc acc gcc gac acc 
thr thr ala asp thr 

agg age ttt gac ggg 
arg ser phe asp gly 

cct gac gac gtc cgc 
pro asp asp val arg 

gca acc gtc gcc cac 
ala thr val ala his 

ggc tat gac gcg ttg 
gly tyr asp ala leu 

ctg teg gac gcg etc 
leu ser asp ala leu 

gag ctg cgt gcg aag 
glu leu arg ala lys 

gtg ccg ttc aaa tgc 
val pro phe lys cys 

ggt gac cgc tac gcc 
gly asp arg tyr ala 

atg ccc gtt gca ggt 
met pro val ala gly 

gtc ggc ttc cat cct 
val gly phe his pro 

ttc ggt gac ggc acg 
phe gly asp gly thr 

tec gcc gcg gcg egg 
ser ala ala ala arg 

acc ctg tec act age 
thr leu ser thr ser 

ccg aat ggc aaa ccg 
pro asn gly lys pro 

gcc cac ggc gtc gcc 
ala his gly val ala 

ggc gcc tgc tac gtc 
gly ala cys tyr val 

get ccg teg gcg ccc 
ala pro ser ala pro 

gtc gca caa gaa ctg 
val ala gin glu leu 



SEQ ID N0.48D 



FIGURE 48D 
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ORF according to Cole et ai , 
sequence Rv0331: 



1998 (Nature 393: 537-544) and containing coding 



1/1 








d Cl "wCX 


ccc 




ccg 


OPA thr 


pro 


ala 


pro 


61/21 










gcc 


gat 


aat 


pro t rp 


ala 


asp 


asn 


121/41 








rrcrr cTcn 


gaa 


tct 


aat 


3 1 a al a 

«U 


glu 


ser 


asn 


181/61 








O. VJ V-* dCl^ 


acg 


gtt 


etc 


ser lys 


thr 


val 


leu 


241/81 










ctg 


eta 


cea 


ajl^g gin 


leu 


leu 


pro 


301/101 








^ ^y y y 


ttg 


teg 


tta 


leu crlv 


leu 


ser 


leu 


361/121 








cgc ccc 


acc 


y '^y 


y 


t* X. ^ ^> w 


thr 


ala 


ala 


421/141 








gac ate 




gcc 


ca0 


asp ile 


ala 


ala 


gin 


481/161 








ate cfca 


tta 


ggt 


oca 
y 


ile ala 


leu 


gly 


ala 


541/181 








gcc gac 


gtc 


gcg 


y y ^ 


ala asp 


val 


ala 


gly 


601/201 








gag gcg 


etc 


gag 


cat 


glu ala 


leu 


glu 


his 


661/221 








gcc gca 


ccg 


ttc 


gaa 


ala ala 


pro 


phe 


glu 


721/241 








gga acc 


gta 


cag 


ate 


gly thr 


val 


gin 


ile 


781/261 








gag gtc 


ggc 


gag 


get 


glu val 


gly glu 


ala 


841/281 








aag gcc 


eta 


get 


cge 


lys ala 


leu 


ala 


arg 



acg egg cga caa teg 
thr arg arg gin ser 

agt ttg act gac teg 
ser leu thr asp ser 

att cca aag ata tat 
ile pro lys ile tyr 

ate ctt ggc gcg ggt 
ile leu gly ala gly 

ect gag gat cga ate 
pro glu asp arg ile 

eta tgg gtg ttg egg 
leu trp val leu arg 

teg ctg ccc ggt gtg 
ser leu pro gly val 

gta gtg cac ace gac 
val val his thr asp 

gcg ctg aac acc gac 
ala leu asn thr asp 

cag ttc tac acc ctg 
gin phe tyr thr leu 

ggc egg ate get gtg 
gly arg ile ala val 

gcg gcg ttt ctg ate 
ala ala phe leu ile 

gac acg ttc acg cct 
asp thr phe thr pro 

ttg gtc teg atg etc 
leu val ser met leu 

gtc gat gag gcc gca 
val asp glu ala ala 



31/11 








egg aaa 


acc 


ggt 


ccg 


arcf Ivs 


thr 


gly pro 


91/31 








y V'V^ ^y ^ 


cac 


ccc 


aag 


val ser 


his 


pro 


Ivs 


151/51 








gga act 


cga 


tge 


gaa 


gly thr 


arg 


cvs 


alu 


211/71 








ate acre 


acre 

y y ^ 


eta 


acc 


val gl^ 


gly 


leu 


thr 


271/91 








ata ttg 


0ta 


gac 


aaa 
**y y 


ile leu 


val 


asp 


arq 


331/111 








QQC tacr 
yy^ ^yy 


^y y 


caa 
^y y 


cct 


crlv trp 


aro 
y 


ara 


pro 


391/131 








gaa atg 


att 


act 


gca 


glu met 


val 


thr 


ala 


451/151 








aac age 


ate 


ate 


QQC 


asn ser 


val 


ile 


alv 


511/171 








acc att 


ccc 


gga 


ctg 


ala val 


pro 


gly 


leu 


571/191 








gac ggc 


gcg 


get 


gag 


asp gly 


ala 


ala 


glu 


631/211 








get ate 


gee 


ggg 


gtg 


ala ile 


ala 


gly 


val 


691/231 








gcc gcc 


caa 


etc 


ggt 


ala ala 


gin 


leu 


gly 


751/251 








gac ccg 


ctg 


ccg 


atg 


asp pro 


leu 


pro 


met 


811/271 








aag gat 


cac 


ggt 


gtc 


lys asp 


his 


gly 


val 


871/291 








agg acg 


atg 


cac 


ttc 


arg thr 


met 


his 


phe 



egg gaa tgc tge ggg 
arg glu cys cys gly 

acc ttg cgc aag act 
thr leu arg lys thr 

gga ate agg etc atg 
gly ile arg leu met 

acc gcc gac acc etc 
thr ala asp thr leu 

age ttt gac ggg acg 
ser phe asp gly thr 

gac gac gtc cgc gtc 
asp asp val arg val 

acc gtc gcc cac att 
thr val ala his ile 

tat gac gcg ttg gtg 
tyr asp ala leu val 

teg gac gcg etc gac 
ser asp ala leu asp 

ctg cgt gcg aag gtc 
leu arg ala lys val 

ccg ttc aaa tgc cea 
pro phe lys cys pro 

gac cgc tac gcc acc 
asp arg tyr ala thr 

ccc gtt gca ggt ccc 
pro val ala gly pro 

ggc ttc cat cct cgc 
gly phe his pro arg 

ggt gac ggc acg tec 
gly asp gly thr ser 



SEQ ID No,48F 
FIGURE 48F 



165/185 



901/301 

gaa ccg ttc gat ctg ctt gcc gtg gtc ccc 
glu pro phe asp leu leu ala val val pro 
961/321 

gcg ggt etc age gaa tec ggg tgg ata ccc 
ala gly leu ser glu ser gly trp ile pro 
1021/341 

gac aac gtg tgg gcc ate ggc gat gcg acc 
asp asn val trp ala ile gly asp ala thr 
1081/361 

ccc aag get gcc gtg ttc gcc gaa gcc cag 
pro lys ala ala val phe ala glu ala gin 
1141/381 

cat etc ggt tac gac gta get gag ege cac 
his leu gly tyr asp val ala glu arg his 
1201/401 

acc ggt gat cac cag gca gcc aag ggc gac 
thr gly asp his gin ala ala lys gly asp 
1261/421 

gtg aeg ctg tac ccg ccg teg egg gag ttt 
val thr leu tyr pro pro ser arg glu phe 
1321/441 

tgg ctg acc ege tgg aag aeg tga 
trp leu thr arg trp lys thr OPA 



931/311 

ccg cac gtg ccc tec gcc gcg gcg egg tea 
pro his val pro ser ala ala ala arg ser 
991/331 

gtg gac ccg ege acc ctg tec act age gcc 
val asp pro arg thr leu ser thr ser ala 
1051/351 

gtg ctg aeg ctg ccg aat ggc aaa ccg ctg 
val leu thr leu pro asn gly lys pro leu 
1111/371 

gcc gca gtt gtc gcc cac ggc gtc gcc ege 
ala ala val val ala his gly val ala arg 
1171/391 

ttc aec ggc aeg ggc gcc tge tac gtc gag 
phe thr gly thr gly ala cys tyr val glu 
1231/411 

ggc gat ttc ttc get ccg teg gcg ccc teg 
gly asp phe phe ala pro ser ala pro ser 
1291/431 

cac gag gag aag gtc gca caa gaa ctg gee 
his glu glu lys val ala gin glu leu ala 



SEQ ID N0.48F (continued) 
FIGURE 48F (continued) 
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Fragment amplified by PGR 
protease of the E.coli htrA 
of the SnaBI site at the 3' 

1/1 

cca tct aca ccg etc aac age 
pro ser thr pro leu asn ser 
61/21 

tga tec gtg gcg agt tgt tea 
OPA ser val ala ser cys ser 
121/41 

tec gtc tga cca acg gta gtt 
ser val OPA pro thr val val 
181/61 

tea tga aca age tgc gtt ggg 
ser OPA thr ser cys val gly 
241/81 

egg tgg ccg ggg gga tgg tea 
arg trp pro gly gly trp ser 
301/101 

egg ccg age ggg tgg cgc gaa 
arg pro ser gly trp arg glu 
361/121 

aat tgg cca gge tga eag agg 
asn trp pro gly OPA gin arg 
421/141 

aac gge agg caa gge tgg tta 
asn gly arg gin gly trp leu 
481/161 

tgc gca cca atg teg aac tct 
cys ala pro met ser asn ser 
541/181 

age agg aga tgg teg ace tgc 
ser arg arg trp ser thr cys 
601/201 

tgg tag gcg att tgg tgg acc 
trp AMB ala ile trp trp thr 
661/221 

teg aca tgg etg acg teg teg 
ser thr trp leu thr ser ser 
721/241 

tec ttt teg aeg teg agg tga 
ser phe ser thr ser arg OPA 
781/261 

gga tgg cgc tta acc tga tgg 
gly trp arg leu thr OPA trp 
841/281 

gtg tea gge tga gee age teg 
val ser gly OPA ala ser ser 



based on the sequence 
family (creation of the 
end) and subeloned into 

31/11 

egg gcc aga cgc tgc egg 
arg ala arg arg cys arg 
91/31 

tgt cgc gge gca cca ccg 
cys arg gly ala pro pro 

151/51 
cgc tgc tga tct cca aaa 
arg cys OPA ser pro lys 

211/71 
tgc tat tga teg tgg gtg 
cys tyr OPA ser trp val 

271/91 
ccc ggg ceg gge tga gge 
pro gly pro gly OPA gly 

331/111 
ccg acg acc tgc gge cca 
pro thr thr cys gly pro 

391/131 
eat tea att taa tgc tgc 
his ser ile OCH cys cys 

451/151 
ccg aeg ccg gae atg aat 
pro thr pro asp met asn 

511/171 
tga tgg cct cga tgg ccc 
OPA trp pro arg trp pro 

571/191 
gtg ccg atg tgc tgg etc 
val pro met cys trp leu 

631/211 
tgt ccc gag gcg acg ccg 
cys pro glu ala thr pro 

691/231 
ace gca gee tgg age ggg 
thr ala ala trp ser gly 

751/251 
ttg ggt gge agg ttt atg 
leu gly gly arg phe met 

811/271 
aca acg ccg cga agt gga 
thr thr pro arg ser gly 

871/291 
aeg cgt cgc acg etg age 
thr arg arg thr leu ser 



similarities with a serine 
BamHI site at the 5* end and 
the vector pJVED: 



teg gtg etg ccg aga agg egg 
ser val leu pro arg arg arg 

ccg acc aac ggg tgc ttg cca 
pro thr asn gly cys leu pro 

gtc tea age cca ccg aag cag 
val ser ser pro pro lys gin 

gga teg ggg tgg egg teg ccg 
gly ser gly trp arg ser pro 

egg tgg gee gcc tea ccg aag 
arg trp ala ala ser pro lys 

tec ccg tct teg gca gcg aeg 
ser pro ser ser ala ala thr 

ggg cgc tgg ccg agt cac ggg 
gly arg trp pro ser his gly 

tgc gta ecc cgc taa cgt cgc 
cys val pro arg OCH arg arg 

egg ggg etc cgc gge tae cca 
arg gly leu arg gly tyr pro 

aaa teg agg aat tgt cca cac 
lys ser arg asn cys pro his 

gag aag tgg tgc acg age egg 
glu lys trp cys thr ser arg 

tea gge gge gge gca acg ata 
ser gly gly gly ala thr ile 

gcg ata ceg etg gat tgt cgc 
ala ile pro leu asp cys arg 

gcc cgc egg gcg gee acg tgg 
ala arg arg ala ala thr trp 

tgg tgg ttt ccg acc gcg gee 
trp trp phe pro thr ala ala 



SEQ ID N0.49A 
FIGURE 4 9A 
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901/301 

egg gca ttc ccg tgc 
arg ala phe pro eys 
961/321 

cac ggg cgt tgc egg 
his gly arg cys arg 
1021/341 

acg gcg gat tgc tgc 
thr ala asp cys cys 
1081/361 

ttt acg tgc tgc tec 
phe thr cys cys ser 
1141/381 

gcg etc gga gca egg 
ala leu gly ala arg 
1201/401 

etc agt cca cgc gcg 
leu ser pro arg ala 
1261/421 

cca cgc atg gcc aag 
pro arg met ala lys 
1321/441 

atg acg aat cac cca 
met thr asn his pro 
1381/461 

cag ggg cag cag caa 
gin gly gin gin gin 
1441/481 

ccg cag cca acc cag 
pro gin pro thr gin 
1501/501 

ata cct ggc gtg att 
ile pro gly val ile 
1561/521 

gca gge atg ttg gcc 
ala gly met leu ala 
1621/541 

gcg gcc gca tec ctg 
ala ala ala ser leu 
1681/561 

get gcc age gcg gcg 
ala ala ser ala ala 
1741/581 

gtg gcg gcc aag gtg 
val ala ala lys val 
1801/601 

gag gag ggc tec ggc 
glu glu gly ser gly 
1861/621 

ate gcg gcg gcc gcc 
ile ala ala ala ala 



agg age gcc gtc tgg 
arg ser ala val trp 

gtt egg gcc teg ggt 
val arg ala ser gly 

gca teg aag aca ccg 
ala ser lys thr pro 

ccg gcc gtc gga tgc 
pro ala val gly cys 

aca teg aga act etc 
thr ser arg thr leu 

caa cct agt tgt gca 
gin pro ser cys ala 

ttg gcc cga gta gtg 
leu ala arg val val 

egg tat teg cca ccg 
arg tyr ser pro pro 

acg tac age cag cag 
thr tyr ser gin gin 

tac cgt caa cec tac 
tyr arg gin pro tyr 

ccg acc atg acg cec 
pro thr met thr pro 

ate ggc gcg gtg acg 
ile gly ala val thr 

gtc ggg ttc aac egg 
val gly phe asn arg 

cca age ate cec gca 
pro ser ile pro ala 

gtg cec agt gtc gtc 
val pro ser val val 

ate att ctg tet gee 
ile ile leu ser ala 

aag cct cec ctg gge 
lys pro pro leu gly 



931/311 

tgt ttg aac ggt ttt 
eys leu asn gly phe 
991/331 

tgg cga teg tea aac 
trp arg ser ser asn 
1051/351 

acc cag gcg gcc age 
thr gin ala ala ser 
1111/371 

cga ttc cgc age ttc 
arg phe arg ser phe 
1171/391 

ggg gtt egg cga acg 
gly val arg arg thr 
1231/411 

gtt act gtt gaa age 
val thr val glu ser 
1291/431 

ggc eta gta cag gaa 
gly leu val gin glu 
1351/451 

ccg cag cag ccg gga 
pro gin gin pro gly 
1411/471 

ttc gac tgg cgt tac 
phe asp trp arg tyr 
1471/491 

gag gcg ttg ggt ggt 
glu ala leu gly gly 
1531/511 

cct cct ggg atg gtt 
pro pro gly met val 
1591/531 

ata gcg gtg gtg tec 
ile ala val val ser 
1651/551 

gca cec gcc ggc cec 
ala pro ala gly pro 
1711/571 

gca aac atg ccg ccg 
ala asn met pro pro 
1771/591 

atg ttg gaa acc gat 
met leu glu thr asp 
1831/611 

gag ggg ctg ate ttg 
glu gly leu ile leu 
1891/631 

agt ccg ccg ccg aaa 
ser pro pro pro lys 



aee ggt egg cat egg 
thr gly arg his arg 

agg tgg tgc tea acc 
arg trp cys ser thr 

cec ctg gaa cgt cga 
pro leu glu arg arg 

ccg gtg cga egg ctg 
pro val arg arg leu 

tta tet cag tgg aat 
leu ser gin trp asn 

cac acc cat gcc agt 
his thr his ala ser 

gag caa cet age gae 
glu gin pro ser asp 

acc cca ggt tat get 
thr pro gly tyr ala 

cca ccg tec ccg cec 
pro pro ser pro pro 

acc egg ccg ggt ctg 
thr arg pro gly leu 

cgc caa cgc cct cgt 
arg gin arg pro arg 

gcc ggc ate ggc ggc 
ala gly ile gly gly 

age ggc gge cca gtg 
ser gly gly pro val 

ggg teg gtc gaa cag 
gly ser val glu gin 

ctg ggc cgc cag teg 
leu gly arg gin ser 

ace aac aae cac gtg 
thr asn asn his val 

acg acg gta 
thr thr val 



SEQ ID No,49A (continued 1) 
FIGURE 4 9A (continued 1) 
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1/1 

cat eta cac cgc tea 
his leu his arg ser 
61/21 

gat cog tgg cga gtt 
asp pro trp arg val 
121/41 

ccg tct gac caa egg 
pro ser asp gin arg 
181/61 

cat gaa caa get gcg 
his glu gin ala ala 
241/81 

ggt ggc egg ggg gat 

gly gly giy ^^p 

301/101 

ggc cga gcg ggt ggc 
gly arg ala gly gly 
361/121 

att ggc cag get gac 
ile gly gin ala asp 
421/141 

aeg gea ggc aag get 
thr ala gly lys ala 
481/161 

gcg cac caa tgt cga 
ala his gin cys arg 
541/181 

gea gga gat ggt cga 
ala gly asp gly arg 
601/201 

ggt agg cga ttt ggt 
gly arg arg phe gly 
661/221 

cga cat ggc tga cgt 
arg his gly OPA arg 
721/241 

cct ttt cga cgt cga 
pro phe arg arg arg 
781/261 

gat ggc get taa cct 
asp gly ala OCH pro 
841/281 

tgt cag get gag cea 
cys gin ala glu pro 
901/301 

ggg cat tec cgt gea 
gly his ser arg ala 
961/321 

acg ggc gtt gcc ggg 
thr gly val ala gly 
1021/341 

egg egg att get gcg 
arg arg ile ala ala 



aca gee ggg cea gac 
thr ala gly pro asp 

gtt cat gtc gcg gcg 
val his val ala ala 

tag ttc get get gat 
AMB phe ala ala asp 

ttg ggt get att gat 
leu gly ala ile asp 

ggt cac ccg ggc egg 
gly his pro gly arg 

gcg aac cga cga cct 
ala asn arg arg pro 

aga ggc att caa ttt 
arg gly ile gin phe 

ggt tac cga cgc egg 
gly tyr arg arg arg 

act ctt gat ggc etc 
thr leu asp gly leu 

eet gcg tge cga tgt 
pro ala cys arg cys 

gga cct gtc ccg agg 
gly pro val pro arg 

cgt cga ccg cag cct 
arg arg pro gin pro 

ggt gat tgg gtg gea 
gly asp trp val ala 

gat gga caa cgc cgc 
asp gly gin arg arg 

get cga cgc gtc gea 
ala arg arg val ala 

gga gcg ccg tct ggt 
gly ala pro ser gly 

ttc ggg cct egg gtt 
phe gly pro arg val 

cat cga aga cac cga 
his arg arg his arg 



31/11 

get gcc ggt egg tge 
ala ala gly arg cys 
91/31 

cac cac cgc cga cea 
his his arg arg pro 
151/51 

etc caa aag tct caa 
leu gin lys ser gin 
211/71 

cgt ggg tgg gat egg 
arg gly trp asp arg 
271/91 

get gag gee ggt ggg 
ala glu ala gly gly 
331/111 

gcg gcc cat ccc cgt 
ala ala his pro arg 
391/131 

aat get gcg ggc get 
asn ala ala gly ala 
451/151 

aca tga att gcg tac 
thr OPA ile ala tyr 
511/171 

gat ggc ccc ggg ggc 
asp gly pro gly gly 
571/191 

get ggc tea aat cga 
ala gly ser asn arg 
631/211 

cga cgc egg aga agt 
arg arg arg arg ser 
691/231 

gga gcg ggt cag gcg 
gly ala gly gin ala 
751/251 

ggt tta tgg cga tac 
gly leu trp arg tyr 
811/271 

gaa gtg gag ccc gcc 
glu val glu pro ala 
871/291 

cgc tga get ggt ggt 
arg OPA ala gly gly 
931/311 

gtt tga acg gtt tta 
val OPA thr val leu 
991/331 

ggc gat cgt caa aca 
gly asp arg gin thr 
1051/351 

ccc agg egg cea gcc 
pro arg arg pro ala 



tge cga gaa ggc ggt 
cys arg glu gly gly 

s^cg ggt get tge cat 
thr gly ala cys his 

gee cac cga age agt 
ala his arg ser ser 

ggt ggc ggt cgc cgc 
gly gly gly arg arg 

ccg cct cac cga age 
pro pro his arg ser 

ctt egg cag cga cga 
leu arg gin arg arg 

ggc cga gtc aeg gga 
gly arg val thr gly 

ccc get aac gtc get 
pro ala asn val ala 

tec gcg get ace caa 
ser ala ala thr gin 

gga att gtc cac act 
gly ile val his thr 

ggt gea cga gee ggt 
gly ala arg ala gly 

gcg gcg caa cga tat 
ala ala gin arg tyr 

cgc tgg att gtc gcg 
arg trp ile val ala 

ggg egg eca cgt ggg 
gly arg pro arg gly 

ttc cga ccg egg ccc 
phe arg pro arg pro 

ccg gtc ggc ate ggc 
pro val gly ile gly 

ggt ggt get caa eca 
gly gly ala gin pro 

ccc tgg aac gtc gat 
pro trp asn val asp 



SEQ ID N0.49B 
FIGURE 49B 
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1081/361 

tta cgt get get ccc 
leu arg ala ala pro 
1141/381 

cgc teg, gag cac gga 
arg ser glu his gly 
1201/401 

tea gtc cac gcg cgc 
ser val his ala arg 
1261/421 

cac gca tgg cca agt 
his ala trp pro ser 
1321/441 

tga cga ate acc cac 
OPA arg ile thr his 
1381/461 
^99 99^ 

arg gly ser ser lys 
1441/481 

cgc age caa ccc agt 
arg ser gin pro ser 
1501/501 

tac etg gcg tga ttc 
tyr leu ala OPA phe 
1561/521 

cag gca tgt tgg cca 
gin ala cys trp pro 
1621/541 

egg ccg cat ccc tgg 
arg pro his pro trp 
1681/561 

etg cca gcg egg cgc 
leu pro ala arg arg 
1741/581 

tgg egg cca agg tgg 
trp arg pro arg trp 
1801/601 

agg agg get ccg gca 
arg arg ala pro ala 
1861/621 

teg egg egg ccg cca 
ser arg arg pro pro 



egg ccg teg gat gee 
arg pro ser asp ala 

cat cga gaa etc teg 
his arg glu leu ser 

aac eta gtt gtg cag 
asn leu val val gin 

tgg ccc gag tag tgg 
trp pro glu AMB trp 

ggt att cgc cac cgc 
gly ile arg his arg 

cgt aca gee age agt 
arg thr ala ser ser 

ace gtc aac ect aeg 
thr val asn pro thr 

cga cca tga cgc ccc 
arg pro OPA arg pro 

teg gcg egg tga cga 
ser ala arg OPA arg 

teg ggt tea acc ggg 
ser gly ser thr gly 

caa gca tec ccg cag 
gin ala ser pro gin 

tgc cca gtg teg tea 
cys pro val ser ser 

tea ttc tgt etg ccg 
ser phe cys leu pro 

age etc ccc tgg gca 
ser leu pro trp ala 



1111/371 

gat tec gca get tee 
asp ser ala ala ser 
1171/391 

ggg ttc ggc gaa cgt 
gly phe gly glu arg 
1231/411 

tta etg ttg aaa gee 
leu leu leu lys ala 
1291/431 

gee tag tac agg aag 
ala AMB tyr arg lys 
1351/451 

cgc age age egg gaa 
arg ser ser arg glu 
1411/471 

teg act ggc gtt acc 
ser thr gly val thr 
1471/491 

agg cgt tgg gtg gta 
arg arg trp val val 
1531/511 

etc etg gga tgg ttc 
leu leu gly trp phe 
1591/531 

tag egg tgg tgt ccg 
AMB arg trp cys pro 
1651/551 

cac ccg ccg gcc cca 
his pro pro ala pro 
1711/571 

caa aca tgc cgc egg 
gin thr cys arg arg 
1771/591 

tgt tgg aaa ccg ate 
cys trp lys pro ile 
1831/611 

agg ggc tga tet tga 
arg gly OPA ser OPA 
1891/631 

gtc cgc cgc cga aaa 
val arg arg arg lys 



egg tgc gae ggc tgg 
arg cys asp gly trp 

tat etc agt gga ate 
tyr leu ser gly ile 

aca ccc atg cca gtc 
thr pro met pro val 

age aac eta gcg aca 
ser asn leu ala thr 

ccc cag gtt atg etc 
pro gin val met leu 

cac cgt ccc. cgc ccc 
his arg pro arg pro 

ccc ggc egg gtc tga 
pro gly arg val OPA 

gcc aac gcc etc gtg 
ala asn ala leu val 

ccg gca teg gcg gcg 
pro ala ser ala ala 

gcg gcg gcc cag tgg 
ala ala ala gin trp 

ggt egg teg aac agg 
gly arg ser asn arg 

tgg gcc gcc agt egg 
trp ala ala ser arg 

cca aca acc acg tga 
pro thr thr thr OPA 

cga egg ta 
arg arg 
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1/1 






ate tac 


acc 


get 


ile tyr 


thr 


ala 


61/21 






ate cgt 


ggc 


gag 


ile arg 


gly 


glu 


121/41 






cgt ctg 


acc 


aac 


arg leu 


thr 


asn 


181/61 






atg aac 


aag 


ctg 


met asn 


lys 


leu 


241/81 






gtg gcc 


ggg 


ggg 


val ala 


gly 


gly 


301/101 






gcc gag 


egg 


gtg 


ala glu 


arg 


val 


361/121 






ttg gcc 


agg 


ctg 


leu ala 


arg 


leu 


421/141 






egg cag 


gca 


agg 


arg gin 


ala 


arg 


481/161 






cgc acc 


aat 


gtc 


arg thr 


asn 


val 


541/181 






cag gag 


atg 


gtc 


gin glu 


met 


val 


601/201 






gta ggc 


gat 


ttg 


val gly 


asp 


leu 


661/221 






gac atg 


get 


gac 


asp met 


ala 


asp 


721/241 






ctt ttc 


gac 


gtc 


leu phe 


asp 


val 


781/261 






atg gcg 


ctt 


aac 


met ala 


leu 


asn 


841/281 






gtc agg 


ctg 


age 


val arg 


leu 


ser 


901/301 






ggc att 


ccc 


gtg 


gly ile 


pro 


val 


961/321 






egg gcg 


ttg 


ccg 


arg ala 


leu 


pro 



caa cag ccg ggc 
gin gin pro gly 

ttg ttc atg teg 
leu phe met ser 

ggt agt teg ctg 
gly ser ser leu 

cgt tgg gtg eta 
arg trp val leu 

atg gtc acc egg 
met val thr arg 

gcg cga acc gac 
ala arg thr asp 

aca gag gca ttc 
thr glu ala phe 

ctg gtt acc gac 
leu val thr asp 

gaa etc ttg atg 
glu leu leu met 

gac ctg cgt gcc 
asp leu arg ala 

gtg gac ctg tec 
val asp leu ser 

gtc gtc gac cgc 
val val asp arg 

gag gtg att ggg 
glu val ile gly 

ctg atg gac aac 
leu met asp asn 

cag etc gac gcg 
gin leu asp ala 

cag gag cgc cgt 
gin glu arg arg 

ggt teg ggc etc 
gly ser gly leu 







31/11 


cag 


a eg 


ctg ccg 


gin 


thr 


leu pro 






91/31 


egg 


cgc 


acc ace 


arg 


arg 


thr thr 






151/51 


ctg 


ate 


tec aaa 


leu 


ile 


ser lys 






211/71 


ttg 


ate 


gtg ggt 


leu 


ile 


val gly 






271/91 


gee 


ggg 


ctg agg 


ala 


gly 


leu arg 






331/111 


gac 


ctg 


egg ccc 


asp 


leu 


arg pro 






391/131 


aat 


tta 


atg ctg 


asn 


leu 


met leu 






451/151 


gcc 


gga 


cat gaa 


ala 


gly 


his glu 






511/171 


gcc 


teg 


atg gcc 


ala 


ser 


met ala 






571/191 


gat 


gtg 


ctg get 


asp 


val 


leu ala 






631/211 


cga 


ggc 


gac gcc 


arg 


gly 


asp ala 






691/231 


age 


ctg 


gag egg 


ser 


leu 


glu arg 






751/251 


tgg 


cag 


gtt tat 


trp gin 


val tyr 






811/271 


gcc 


gcg 


aag tgg 


ala 


ala 


lys trp 






871/291 


teg 


cac 


get gag 


ser 


his 


ala glu 






931/311 


ctg 


gtg 


ttt gaa 


leu 


val 


phe glu 






991/331 


ggg 


ttg 


gcg ate 


gly 


leu 


ala ile 



gtc ggt get gcc 
val gly ala ala 

gcc gac caa egg 
ala asp gin arg 

agt etc aag ccc 
ser leu lys pro 

ggg ate ggg gtg 
gly ile gly val 

ccg gtg ggc cgc 
pro val gly arg 

ate ccc gtc ttc 
ile pro val phe 

egg gcg ctg gcc 
arg ala leu ala 

ttg cgt acc ccg 
leu arg thr pro 

ccg ggg get ccg 
pro gly ala pro 

caa ate gag gaa 
gin ile glu glu 

gga gaa gtg gtg 
gly glu val val 

gtc agg egg egg 
val arg arg arg 

ggc gat acc get 
gly asp thr ala 

age ccg ccg ggc 
ser pro pro gly 

ctg gtg gtt tec 
leu val val ser 

egg ttt tac egg 
arg phe tyr arg 

gtc aaa cag gtg 
val lys gin val 



gag aag gcg gtg 
glu lys ala val 

gtg ctt gcc ate 
val leu ala ile 

ace gaa gca gtc 
thr glu ala val 

gcg gtc gcc gcg 
ala val ala ala 

etc ace gaa gcg 
leu thr glu ala 

ggc age gac gaa 
gly ser asp glu 

gag tea egg gaa 
glu ser arg glu 

eta acg teg ctg 
leu thr ser leu 

egg eta ccc aag 
arg leu pro lys 

ttg tec aca ctg 
leu ser thr leu 

cac gag ccg gtc 
his glu pro val 

cgc aac gat ate 
arg asn asp ile 

gga ttg teg egg 
gly leu ser arg 

ggc cac gtg ggt 
gly his val gly 

gac cgc ggc ccg 
asp arg gly pro 

teg gca teg gca 
ser ala ser ala 

gtg etc aac cac 
val leu asn his 



SEQ ID N0.49C 
FIGURE 49C 
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1021/341 

ggc gga ttg ctg cgc 
gly gly leu leu arg 
1081/361 

tac gtg ctg etc ccc 
tyr val leu leu pro 
1141/381 

get egg age acg gac 
ala arg ser thr asp 
1201/401 

cag tec acg cgc gca 
gin ser thr arg ala 
1261/421 

aeg cat ggc caa gtt 
thr his gly gin val 
1321/441 

gac gaa tea ece acg 
asp glu ser pro thr 
1381/461 

ggg gca gca gca aac 
gly ala ala ala asn 
1441/481 

gca gcc aac cea gta 
ala ala asn pro val 
1501/501 

ace tgg cgt gat tec 
thr trp arg asp ser 
1561/521 

agg cat gtt ggc cat 
arg his val gly his 
1621/541 

ggc cgc ate ect ggt 
gly arg ile pro gly 
1681/561 

tgc eag cgc ggc gcc 
cys gin arg gly ala 
1741/581 

ggc ggc caa ggt ggt 
gly gly gin gly gly 
1801/601 

gga ggg etc egg eat 
gly gly leu arg his 
1861/621 

cgc ggc ggc cgc caa 
arg gly gly arg gin 



ate gaa gac ace gac 
ile glu asp thr asp 

ggc cgt egg atg ccg 
gly arg arg met pro 

ate gag aac tet egg 
ile glu asn ser arg 

acc tag ttg tgc agt 
thr AMB leu cys ser 

ggc ccg agt agt ggg 
gly pro ser ser gly 

gta ttc gee acc gee 
val phe ala thr ala 

gta cag cea gca gtt 
val gin pro ala val 

ccg tea ace eta cga 
pro ser thr leu arg 

gac cat gae gcc ccc 
asp his asp ala pro 

egg cgc ggt gac gat 
arg arg gly asp asp 

egg gtt caa ccg ggc 
arg val gin pro gly 

aag eat ccc cgc age 
lys his pro arg ser 

gcc cag tgt cgt eat 
ala gin cys arg his 

cat tet gtc tgc cga 
his ser val cys arg 

gcc tec cct ggg cag 
ala ser pro gly gin 



1051/351 

cea ggc ggc cag ccc 
pro gly gly gin pro 
1111/371 

att ccg cag ctt ccc 
ile pro gin leu pro 
1171/391 

ggt teg gcg aac gtt 
gly ser ala asn val 
1231/411 

tac tgt tga aag cea 
tyr cys OPA lys pro 
1291/431 

cct agt aca gga aga 
pro ser thr gly arg 
1351/451 

gca gca gee ggg aac 
ala ala ala gly asn 
1411/471 

cga ctg gcg tta ccc 
arg leu ala leu pro 
1471/491 

ggc gtt ggg tgg tac 
gly val gly trp tyr 
1531/511 

tec tgg gat ggt teg 
ser trp asp gly ser 
1591/531 

age ggt ggt gtc cgc 
ser gly gly val arg 
1651/551 

ace cgc egg ccc cag 
thr arg arg pro gin 
1711/571 

aaa eat gcc gcc ggg 
lys his ala ala gly 
1771/591 

gtt gga aac cga tet 
val gly asn arg ser 
1831/611 

ggg get gat ctt gae 
gly ala asp leu asp 
1891/631 

tec gcc gee gaa aac 
ser ala ala glu asn 



cct gga acg teg att 
pro gly thr ser ile 

ggt gcg acg get ggc 
gly ala thr ala gly 

ate tea gtg gaa tet 
ile ser val glu ser 

eac cea tgc eag tec 
his pro cys gin ser 

gca acc tag cga cat 
ala thr AMB arg his 

ccc agg tta tgc tea 
pro arg leu cys ser 

acc gtc ccc gcc ccc 
thr val pro ala pro 

ccg gcc ggg tet gat 
pro ala gly ser asp 

eca acg ccc teg tgc 
pro thr pro ser cys 

egg eat egg egg cgc 
arg his arg arg arg 

egg egg ccc agt ggc 
arg arg pro ser gly 

gtc ggt cga aca ggt 
val gly arg thr gly 

ggg ccg eca gtc gga 
gly pro pro val gly 

caa caa eca cgt gat 
gin gin pro arg asp 

gae ggt a 
asp gly 
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Coding sequence Rv0983 predicted by Cole et al,, 1998 (Nature 393:537-544) and 
containing seq60A: 

1/1 31/11 

atg gcc aag ttg gcc cga gta gtg ggc eta gta cag gaa gag caa cot age gac atg aeg 
Met ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp met thr 
61/21 91/31 

aat cac oca egg tat teg cca cog cog cag cag ccg gga acc cca ggt tat get cag ggg 
asn his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala gin gly 
121/41 151/51 

cag cag caa aeg tac age cag cag ttc gac tgg cgt tac cca ecg tec ccg ecc ccg cag 
gin gin gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro pro gin 
181/61 211/71 

cca ace cag tac egt caa ecc tac gag gcg ttg ggt ggt acc egg ccg ggt ctg ata ect 
pro thr gin tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu ile pro 
241/81 271/91 

ggc gtg att ccg acc atg aeg ecc cct cct ggg atg gtt cge caa ege cct cgt gea ggc 
gly val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg ala gly 
301/101 331/111 

atg ttg gcc ate ggc gcg gtg aeg ata gcg gtg gtg tec gee ggc ate ggc ggc gcg gee 
met leu ala ile gly ala val thr ile ala val val ser ala gly ile gly gly ala ala 
361/121 391/131 

gea tec ctg gtc ggg ttc aac egg gea ecc gcc ggc ecc age ggc ggc cca gtg get gee 
ala ser leu val gly phe asn arg ala pro ala gly pro ser gly gly pro val ala ala 
421/141 451/151 

age gcg gcg cca age ate ecc gea gea aac atg ecg ccg ggg teg gtc gaa cag gtg gcg 
ser ala ala pro ser ile pro ala ala asn met pro pro gly ser val glu gin val ala 
481/161 511/171 

gcc aag gtg gtg ecc agt gtc gtc atg ttg gaa acc gat ctg ggc cge cag teg gag gag 
ala lys val val pro ser val val met leu glu thr asp leu gly arg gin ser glu glu 
541/181 571/191 

ggc tec ggc ate att ctg tet gcc gag ggg ctg ate ttg ace aac aac cac gtg ate gcg 
gly ser gly ile ile leu ser ala glu gly leu ile leu thr asn asn his val ile ala 
601/201 631/211 

gcg gee gcc aag cct ecc ctg ggc agt ccg ecg ecg aaa aeg aeg gta acc ttc tet gac 
ala ala ala lys pro pro leu gly ser pro pro pro lys thr thr val thr phe ser asp 
661/221 691/231 

ggg egg acc gea ecc ttc aeg gtg gtg ggg get gac ecc acc agt gat ate gcc gtc gtc 
gly arg thr ala pro phe thr val val gly ala asp pro thr ser asp ile ala val val 
721/241 751/251 

cgt gtt cag ggc gtc tee ggg etc acc ecg ate tec ctg ggt tec tec teg gac ctg agg 
arg val gin gly val ser gly leu thr pro lie ser leu gly ser ser ser asp leu arg 
781/261 811/271 

gtc ggt cag ccg gtg ctg gcg ate ggg teg ccg etc ggt ttg gag ggc acc gtg acc aeg 
val gly gin pro val leu ala ile gly ser pro leu gly leu glu gly thr val thr thr 
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841/281 








aaa ate qtc 


age 


get 


etc 


gly ile val 


ser 


ala 


leu 


901/301 








acc gtg ctg 


gac 


gcc 


att 


thr val leu 


asp 


ala 


ile 


961/321 








ctg gtg aac 


atg 


aac 


get 


leu val asn 


met 


asn 


ala 


1021/341 








gac tea gcc 


gat 


gcg 


cag 


asp ser ala 


asp 


ala 


gin 


1081/361 








gcc aag cgc 


ate 


gcc 


gac 


ala lys arg 


ile 


ala 


asp 


1141/381 








atg caa atg 


acc 


aat 


gac 


val gin val 


thr 


asn 


asp 


1201/401 








ggt get gcc 


gcg 


aac 


get 


gly ala ala 


ala 


asn 


ala 


1261/421 








ccg ate aac 


age 


gcg 


gac 


pro ile asn 


ser 


ala 


asp 


1321/441 








gtg gcg eta 


acc 


ttt 


cag 


val ala leu 


thr 


phe 


gin 


1381/461 








aag gcg gag 


cag 


tga 




lys ala glu 


gin 


OPA 





871/291 
aac cgt eca gtg teg aeg ace 
asn arg pro val ser thr thr 

931/311 
cag acc gac gcc gcg ate aac 
gin thr asp ala ala ile asn 

991/331 
caa etc gtc gga gtc aac teg 
gin leu val gly val asn ser 

1051/351 
age ggc teg ate ggt etc ggt 
ser gly ser ile gly leu gly 

1111/371 
gag ttg ate age acc ggc aag 
glu leu ile ser thr gly lys 

1171/391 
aaa gac acc ctg ggc gcc aag 
lys asp thr leu gly ala lys 

1231/411 
gga gtg ccg aag ggc gtc gtt 
gly val pro lys gly val val 

1291/431 
gcg ttg gtt gcc gcc gtg egg 
ala leu val ala ala val arg 

1351/451 
gat ccc teg ggc ggt age cgc 
asp pro ser gly gly ser arg 



ggc gag gcc ggc aac cag aac 
gly glu ala gly asn gin asn 

ccc ggt aac tec ggg ggc gcg 
pro gly asn ser gly gly ala 

gcc att gcc acg ctg ggc gcg 
ala ile ala thr leu gly ala 

ttt gcg att cca gtc gac cag 
phe ala ile pro val asp gin 

gcg tea eat gcc tec ctg ggt 
ala ser his ala ser leu gly 

ate gtc gaa gta gtg gcc ggt 
ile val glu val val ala gly 

gtc acc aag gtc gac gac cgc 
val thr lys val asp asp arg 

tec aaa gcg ccg ggc gcc acg 
ser lys ala pro gly ala thr 

aca gtg caa gtc acc etc ggc 
thr val gin val thr leu gly 



SEQ ID N0.49D (continued 1) 
FIGURE 49D (continued 1) 



11 



174/185 



ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv0983 



1/1 



31/11 



tga gcc age teg acg cgt cgc acg ctg age tgg tgg ttt ccg ace gcg gee egg gea ttc 
OPA ala ser ser thr arg arg thr leu ser trp trp phe pro thr ala ala arg ala phe 
61/21 91/31 

ccg tgc agg age gcc gtc tgg tgt ttg aac ggt ttt acc ggt egg cat egg cac ggg cgt 
pro eys arg ser ala val trp cys leu asn gly phe thr gly arg his arg his gly arg 
121/41 151/51 

tgc egg gtt egg gee teg ggt tgg cga teg tea aac agg tgg tgc tea acc acg gcg gat 
CVS arg val arg ala ser gly trp arg ser ser asn arg trp cys ser thr thr ala asp 
181/61 211/71 

tgc tgc gca teg aag aca ccg acc cag gcg gcc age ccc ctg gaa cgt cga ttt acg tgc 
cys cys ala ser lys thr pro thr gin ala ala ser pro leu glu arg arg phe thr cys 
241/81 271/91 

tgc tec ccg gcc gtc gga tgc ega ttc cgc age ttc ccg gtg cga egg ctg gcg etc gga 
cys ser pro ala val gly cys arg phe arg ser phe pro val arg arg leu ala leu gly 
301/101 331/111 

gca egg aca teg aga act etc ggg gtt egg cga acg tta tet cag tgg aat etc agt eca 
ala arg thr ser arg thr leu gly val arg arg thr leu ser gin trp asn leu ser pro 
361/121 391/131 

cgc gcg caa ect agt tgt gca gtt act gtt gaa age eac acc cat gcc agt eca cgc atg 
arg ala gin pro ser cys ala val thr val glu ser his thr his ala ser pro arg met 
421/141 451/151 

gcc aag ttg gcc cga gta gtg ggc eta gta cag gaa gag caa ect age gae atg acg aat 
ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp met thr asn 
481/161 511/171 

cac eca egg tat teg cea ccg ceg cag cag ccg gga ace cea ggt tat get cag ggg cag 
his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala gin gly gin 
541/181 571/191 

cag caa acg tac age eag cag ttc gac tgg cgt tac cea ccg tec ccg cec ceg cag cea 
gin gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro pro gin pro 
601/201 631/211 

acc eag tac cgt caa ccc tac gag gcg ttg ggt ggt acc egg ccg ggt ctg ata ect ggc 
thr gin tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu lie pro gly 
661/221 691/231 

gtg att ceg ace atg acg ccc cet cet ggg atg gtt cgc caa cgc ect cgt gca ggc atg 
val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg ala gly met 
721/241 751/251 

ttg gee ate ggc gcg gtg acg ata gcg gtg gtg tec gee ggc ate ggc ggc gcg gee gea 
leu ala ile gly ala val thr ile ala val val ser ala gly ile gly gly ala ala ala 
781/261 811/271 

tec ctg gtc ggg ttc aac egg gca ccc gcc ggc ccc age ggc ggc cea gtg get gcc age 
ser leu val gly phe asn arg ala pro ala gly pro ser gly gly pro val ala ala ser 
841/281 871/291 

gcg gcg cea age ate ccc gea gea aac atg ccg ceg ggg teg gtc gaa cag gtg gcg gee 
ala ala pro ser ile pro ala ala asn met pro pro gly ser val glu gin val ala ala 



SEQ ID N0.49F 
FIGURE 49F 



175/185 



901/301 931/311 

aag gtg gtg ccc agt gtc gtc atg ttg gaa acc gat ctg ggc cgc cag teg gag gag ggc 
lys val val pro ser val val met leu glu thr asp leu gly arg gin ser glu glu gly 
961/321 991/331 

tec ggc ate att ctg tet gee gag ggg ctg ate ttg acc aae aac cac gtg ate gcg geg 
ser gly ile lie leu ser ala glu gly leu lie leu thr asn asn his val lie ala ala 
1021/341 1051/351 

gee gcc aag cct ccc ctg ggc agt ccg ccg ccg aaa acg acg gta acc ttc tct gac ggg 
ala ala lys pro pro leu gly ser pro pro pro lys thr thr val thr phe ser asp gly 
1081/361 1111/371 

egg acc gca ccc ttc acg gtg gtg ggg get gac ccc ace agt gat ate gee gtc gtc cgt 
arg thr ala pro phe thr val val gly ala asp pro thr ser asp ile ala val val arg 
1141/381 1171/391 

gtt cag ggc gtc tec ggg etc ace ccg ate tec ctg ggt tec tee teg gac ctg agg gtc 
val gin gly val ser gly leu thr pro ile ser leu gly ser ser ser asp leu arg val 
1201/401 1231/411 

ggt cag ccg gtg ctg gcg ate ggg teg ccg etc ggt ttg gag ggc acc gtg acc acg ggg 
gly gin pro val leu ala ile gly ser pro leu gly leu glu gly thr val thr thr gly 
1261/421 1291/431 

ate gtc age get etc aac cgt cea gtg teg acg acc ggc gag gee ggc aac cag aac acc 
ile val ser ala leu asn arg pro val ser thr thr gly glu ala gly asn gin asn thr 
1321/441 1351/451 

gtg ctg gac gcc att cag acc gac gee geg ate aac ccc ggt aac tec ggg ggc gcg ctg 
val leu asp ala ile gin thr asp ala ala ile asn pro gly asn ser gly gly ala leu 
1381/461 1411/471 

gtg aac atg aac get caa etc gtc gga gtc aac teg gcc att gcc acg ctg ggc gcg gac 
val asn met asn ala gin leu val gly val asn ser ala ile ala thr leu gly ala asp 
1441/481 1471/491 

tea gcc gat gcg cag age ggc teg ate ggt etc ggt ttt gcg att cea gtc gac cag gcc 
ser ala asp ala gin ser gly ser ile gly leu gly phe ala ile pro val asp gin ala 
1501/501 1531/511 

aag cgc ate gee gac gag ttg ate age ace ggc aag geg tea eat gcc tec ctg ggt gtg 
lys arg ile ala asp glu leu ile ser thr gly lys ala ser his ala ser leu gly val 
1561/521 1591/531 

cag gtg ace aat gac aaa gac acc ctg ggc gee aag ate gtc gaa gta gtg gee ggt ggt 
gin val thr asn asp lys asp thr leu gly ala lys ile val glu val val ala gly gly 
1621/541 1651/551 

get gcc gcg aac get gga gtg ccg aag ggc gtc gtt gtc ace aag gtc gac gac cgc ccg 
ala ala ala asn ala gly val pro lys gly val val val thr lys val asp asp arg pro 
1681/561 1711/571 

ate aac age gcg gac gcg ttg gtt gcc gcc gtg egg tec aaa gcg ccg ggc gcc acg gtg 
ile asn ser ala asp ala leu val ala ala val arg ser lys ala pro gly ala thr val 
1741/581 1771/591 

gcg eta acc ttt cag gat ccc teg ggc ggt age cgc aca gtg caa gtc acc etc ggc aag 

ala leu thr phe gin asp pro ser gly gly ser arg thr val gin val thr leu gly lys 

1801/601 

geg gag cag tga 

ala glu gin OPA 



SEQ ID N0.49F (continued 1) 
FIGURE 49F (continued 1) 
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Fragment annplified by PGR based on the sequence similarities with a serine protease 
the E.coli HtrA family (creation of an SnaBI site at the 3' end) and subcloned into 
vector pJVEDa: 

1/1 31/11 

gat ccg gcg ggg egg gtg teg gcg cag gcg tgg ctg gcg gtc acg gcg gtg egg gcg gtg 
asp pro ala gly arg val ser ala gin ala trp leu ala val thr ala val arg ala val 
61/21 91/31 

ccg ccg ggc tgt ggg gcg ccg gcg gcg gcg gtg gca atg gcg gga acg gcg ccg atg cca 
pro pro gly cys gly ala pro ala ala ala val ala met ala gly thr ala pro met pro 
121/41 151/51 

aca teg tea gcg gtg gag acg gtg gee teg gcg gtg ccg gtg gcg gtg gcg gat ggc tct 
thr ser ser ala val glu thr val ala ser ala val pro val ala val ala asp gly ser 
181/61 211/71 

acg gcg acg gcg ggg ccg gcg gac acg gcg gae aag gcg caa teg gee teg gcg gcg gcg 
thr ala thr ala gly pro ala asp thr ala asp lys ala gin ser ala ser ala ala ala 
241/81 271/91 

ccg gcg gcg acg ggg gcc agg gcg gcg ccg gee gcg gac tgt ggg gta ctg gcg gcg ccg 
pro ala ala thr gly ala arg ala ala pro ala ala asp cys gly val leu ala ala pro 
301/101 331/111 

gcg gac acg gcg ggc aag gcg gtg gta ccg ggg gcc cac cgc tgc ccg gtc agg cag gca 
ala asp thr ala gly lys ala val val pro gly ala his arg cys pro val arg gin ala 
361/121 391/131 

tgg gcg ccg egg gtg gcg ccg gtg ggc tga teg gca acg gcg ggg ccg gcg gcg acg gcg 
trp ala pro arg val ala pro val gly OPA ser ala thr ala gly pro ala ala thr ala 
421/141 451/151 

gtg teg gcg cgt ccg gcg ggg teg ccg gag tag gcg gtg ccg gcg gga acg cca tgc tga 
val ser ala arg pro ala gly ser pro glu AMB ala val pro ala gly thr pro cys OPA 
481/161 511/171 

teg ggc acg gcg gcg ccg gcg gcg ccg gcg gag aca gca gtt teg eta atg gcg egg ccg 
ser gly thr ala ala pro ala ala pro ala glu thr ala val ser leu met ala arg pro 
541/181 571/191 

gcg gcg egg gcg gtg ccg gag ggc ace tct teg gca atg gcg ggt ccg gcg gcc acg gcg 
ala ala arg ala val pro glu gly thr ser ser ala met ala gly pro ala ala thr ala 
601/201 631/211 

gag ccg tea egg ccg gca aca ccg gta teg gtg gcg ccg gcg gcg teg gtg ggg acg cca 
glu pro ser arg pro ala thr pro val ser val ala pro ala ala ser val gly thr pro 
661/221 691/231 

ggc tga teg gcc acg gtg gcg ccg gcg gtg eeg gcg ggg ace gcg ccg gag cet tgg ttg 
gly OPA ser ala thr val ala pro ala val pro ala gly thr ala pro glu pro trp leu 
721/241 751/251 

gee gtg acg gcg ggc ccg gtg gga acg ggg gcg ctg gcg gee age tat acg gca acg gcg 
ala val thr ala gly pro val gly thr gly ala leu ala ala ser tyr thr ala thr ala 
781/261 811/271 

gcg acg gcg cec ccg gca ccg gcg gaa cac tgc agg egg egg tga gcg gat tgg tga egg 
ala thr ala pro pro ala pro ala glu his cys arg arg arg OPA ala asp trp OPA arg 
841/281 871/291 

ctt tgt teg gtg cac ccg gcc aac ccg gcg aca ccg gcc aac ccg get age cec gat caa 
leu cys ser val his pro ala asn pro ala thr pro ala asn pro ala ser pro asp gin 
901/301 931/311 

cga ggg ttt egg tgc egg tec ggg gca tgg cca tec get gag ctg gcg ate tgg act acg 
arg gly phe arg cys arg ser gly ala trp pro ser ala glu leu ala lie trp thr thr 
961/321 991/331 

ttg gtg tag aaa aat cet gcc gcc egg ace ctt aag get ggg aca att tct gat age tac 
leu val AMB lys asn pro ala ala arg thr leu lys ala gly thr ile ser asp ser tyr 
1021/341 1051/351 

cec gac aca gga ggt tac ggg atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg 
pro asp thr gly gly tyr gly met ser asn ser arg arg arg ser leu arg trp ser trp 
1081/361 1111/371 

ttg ctg age gtg ctg get gee gtc ggg ctg ggc ctg gee acg gcg ccg gee cag gcg gcc 
leu leu ser val leu ala ala val gly leu gly leu ala thr ala pro ala gin ala ala 
1141/381 

ccg ccg gcc ttg teg cag gae egg tt 
pro pro ala leu ser gin asp arg 



SEQ ID No, 5 OA 



FIGURE 50A 



177/185 



1/1 31/11 

ate egg egg ggc ggg tgt egg cgc agg cgt ggc tgg egg tea egg egg tge ggg egg tgc 
ile arg arg gly gly eys arg arg arg arg gly trp arg ser arg arg cys gly arg cys 
61/21 91/31 

cgc egg get gtg ggg cgc egg egg egg egg tgg caa tgg egg gaa egg cgc cga tgc caa 
arg arg ala val gly arg arg arg arg arg trp gin trp arg glu arg arg arg cys gin 
121/41 151/51 

cat cgt cag egg tgg aga egg tgg cct egg egg tgc egg tgg egg tgg egg atg get eta 
his arg gin arg trp arg arg trp pro arg arg cys arg trp arg trp arg met ala leu 
181/61 211/71 

egg cga egg egg ggc egg egg aca egg egg aca agg cgc aat egg cct egg egg egg cgc 
arg arg arg arg gly arg arg thr arg arg thr arg arg asn arg pro arg arg arg arg 
241/81 271/91 

egg egg cga egg ggg cca ggg egg cgc egg ccg egg act gtg ggg tac tgg egg cgc egg 
arg arg arg arg gly pro gly arg arg arg pro arg thr val gly tyr trp arg arg arg 
301/101 331/111 

egg aca egg egg gea agg egg tgg tac egg ggg cce acc get gee egg tea ggc agg cat 
arg thr arg arg ala arg arg trp tyr arg gly pro thr ala ala arg ser gly arg his 
361/121 391/131 

ggg cgc cgc ggg tgg cgc egg tgg get gat egg caa egg egg ggc egg egg cga egg egg 
gly arg arg gly trp arg arg trp ala asp arg gin arg arg gly arg arg arg arg arg 
421/141 451/151 

tgt egg cgc gtc egg egg ggt cgc egg agt agg egg tgc egg egg gaa cgc cat get gat 
eys arg arg val arg arg gly arg arg ser arg arg eys arg arg glu arg his ala asp 
481/161 511/171 

egg gea egg egg cgc egg egg cgc egg egg aga cag cag ttt cgc taa tgg cgc ggc egg 
arg ala arg arg arg arg arg arg arg arg arg gin gin phe arg OCH trp arg gly arg 
541/181 571/191 

egg cgc ggg egg tge egg agg gea cct ctt egg caa tgg egg gtc egg egg cca egg egg 
arg arg gly arg cys arg arg ala pro leu arg gin trp arg val arg arg pro arg arg 
601/201 631/211 

age cgt cae ggc egg caa cac egg tat egg tgg cgc egg egg cgt egg tgg gga cgc cag 
ser arg his gly arg gin his arg tyr arg trp arg arg arg arg arg trp gly arg gin 
661/221 691/231 

get gat egg cca egg tgg cgc egg egg tgc egg egg gga ccg cgc egg age ctt ggt tgg 
ala asp arg pro arg trp arg arg arg eys arg arg gly pro arg arg ser leu gly trp 
721/241 751/251 

ccg tga egg egg gee egg tgg gaa egg ggg cgc tgg egg cca get at a egg caa egg egg 
pro OPA arg arg ala arg trp glu arg gly arg trp arg pro ala ile arg gin arg arg 
781/261 811/271 

cga egg cgc cec egg cac egg egg aac act gea ggc ggc ggt gag egg att ggt gac ggc 
arg arg arg pro arg his arg arg asn thr ala gly gly gly glu arg ile gly asp gly 
841/281 871/291 

ttt gtt egg tge acc egg cca acc egg cga cae egg cca ace egg eta gee ccg ate aac 
phe val arg cys thr arg pro thr arg arg his arg pro thr arg leu ala pro ile asn 
901/301 931/311 

gag ggt tte ggt gee ggt ccg ggg cat ggc cat ccg ctg age tgg cga tet gga eta cgt 
glu gly phe gly ala gly pro gly his gly his pro leu ser trp arg ser gly leu arg 
961/321 991/331 

tgg tgt aga aaa ate ctg ccg cec gga cce tta agg ctg gga caa ttt ctg ata get acc 
trp cys arg lys ile leu pro pro gly pro leu arg leu gly gin phe leu ile ala thr 
1021/341 1051/351 

ccg aca cag gag gtt aeg gga tga gea att cgc gee gee get cac tea ggt ggt cat ggt 
pro thr gin glu val thr gly OPA ala ile arg ala ala ala his ser gly gly his gly 
1081/361 1111/371 

tgc tga gcg tge tgg ctg ccg teg ggc tgg gee tgg cca egg cgc egg cec agg egg cce 
cys OPA ala eys trp leu pro ser gly trp ala trp pro arg arg arg pro arg arg pro 
1141/381 

cgc egg cct tgt cgc agg ace ggt t 
arg arg pro eys arg arg thr gly 

SEQ ID NO.50B 
FIGURE 50B 



178/185 



1/1 

tec ggc ggg gcg ggt gtc ggc gca ggc gtg 
ser gly gly ala gly val gly ala gly val 
61/21 

gcc ggg ctg tgg ggc gcc ggc ggc ggc ggt 
ala gly leu trp gly ala gly gly gly gly 
121/41 

ate gtc age ggt gga gac ggt ggc etc ggc 
ile val ser gly gly asp gly gly leu gly 
181/61 

ggc gac ggc ggg gcc ggc gga cac ggc gga 
gly asp gly gly ala gly gly his gly gly 
241/81 

ggc ggc gac ggg ggc cag ggc ggc gcc ggc 
gly gly asp gly gly gin gly gly ala gly 
301/101 

gga cac ggc ggg caa ggc ggt ggt ace ggg 
gly his gly gly gin gly gly gly thr gly 
361/121 

ggc gcc gcg ggt ggc gcc ggt ggg ctg ate 
gly ala ala gly gly ala gly gly leu ile 
421/141 

gtc ggc gcg tec ggc ggg gtc gcc gga gta 
val gly ala ser gly gly val ala gly val 
481/161 

ggg cac ggc ggc gcc ggc ggc gcc ggc gga 
gly his gly gly ala gly gly ala gly gly 
541/181 

ggc gcg ggc ggt gcc gga ggg cac etc ttc 
gly ala gly gly ala gly gly his leu phe 
601/201 

gcc gtc acg gcc ggc aac acc ggt ate ggt 
ala val thr ala gly asn thr gly ile gly 
661/221 

ctg ate ggc cac ggt ggc gcc ggc ggt gcc 
leu ile gly his gly gly ala gly gly ala 
721/241 

cgt gac ggc ggg ccc ggt ggg aac ggg ggc 
arg asp gly gly pro gly gly asn gly gly 
781/261 

gac ggc gcc ccc ggc acc ggc gga aca ctg 
asp gly ala pro gly thr gly gly thr leu 
841/281 

ttg ttc ggt gca ccc ggc caa ccc ggc gac 
leu phe gly ala pro gly gin pro gly asp 
901/301 

agg gtt teg gtg ceg gtc egg ggc atg gcc 
arg val ser val pro val arg gly met ala 
961/321 

ggt gta gaa aaa tec tgc cgc ceg gac ect 
gly val glu lys ser cys arg pro asp pro 
1021/341 

cga cac agg agg tta egg gat gag caa ttc 
arg his arg arg leu arg asp glu gin phe 
1081/361 

get gag cgt get ggc tgc cgt egg get ggg 
ala glu arg ala gly cys arg arg ala gly 
1141/381 

gcc ggc ctt gtc gca gga ceg gtt 
ala gly leu val ala gly pro val 



31/11 

get ggc ggt cac ggc ggt gcg ggc ggt gcc 
ala gly gly his gly gly ala gly gly ala 
91/31 

ggc aat ggc ggg aac ggc gcc gat gcc aac 
gly asn gly gly asn gly ala asp ala asn 
151/51 

ggt gcc ggt ggc ggt ggc gga tgg etc tac 
gly ala gly gly gly gly gly trp leu tyr 
211/71 

caa ggc gca ate ggc etc ggc ggc ggc gcc 
gin gly ala ile gly leu gly gly gly ala 
271/91 

cgc gga ctg tgg ggt act ggc ggc gee ggc 
arg gly leu trp gly thr gly gly ala gly 
331/111 

ggc cea ceg ctg ccc ggt cag gca ggc atg 
gly pro pro leu pro gly gin ala gly met 
391/131 

ggc aac ggc ggg gcc ggc ggc gac ggc ggt 
gly asn gly gly ala gly gly asp gly gly 
451/151 

ggc ggt gcc ggc ggg aac gcc atg ctg ate 
gly gly ala gly gly asn ala met leu ile 
511/171 

gac age agt ttc get aat ggc gcg gcc ggc 
asp ser ser phe ala asn gly ala ala gly 
571/191 

ggc aat ggc ggg tec ggc ggc cac ggc gga 
gly asn gly gly ser gly gly his gly gly 
631/211 

ggc gcc ggc ggc gtc ggt ggg gac gcc agg 
gly ala gly gly val gly gly asp ala arg 
691/231 

ggc ggg gac cgc gcc gga gcc ttg gtt ggc 
gly gly asp arg ala gly ala leu val gly 
751/251 

get ggc ggc cag eta tac ggc aac ggc ggc 
ala gly gly gin leu tyr gly asn gly gly 
811/271 

cag gcg gcg gtg age gga ttg gtg acg get 
gin ala ala val ser gly leu val thr ala 
871/291 

acc ggc caa ccc ggc tag ccc cga tea acg 
thr gly gin pro gly AMB pro arg ser thr 
931/311 

ate cgc tga get ggc gat ctg gac tac gtt 
ile arg OPA ala gly asp leu asp tyr val 
991/331 

taa ggc tgg gac aat ttc tga tag eta ccc 
OCH gly trp asp asn phe OPA AMB leu pro 
1051/351 

gcg ceg ceg etc act cag gtg gtc atg gtt 
ala pro pro leu thr gin val val met val 
1111/371 

ect ggc cac ggc gee ggc cca ggc ggc ccc 
pro gly his gly ala gly pro gly gly pro 



SEQ ID No.SOC 
FIGURE 50C 
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Coding sequence Rv0125 predicted by Cole et al , , 1998 (Nature 393:537-544) and 
containing seqSOA: 

1/1 31/11 

atg age aat teg cgc cgc egc tea etc agg tgg tea tgg ttg ctg age gtg ctg get gee 
Met ser asn ser arg arg arg ser leu arg trp ser trp leu leu ser val leu ala ala 
61/21 91/31 

gtc ggg ctg ggc ctg gee aeg gcg ccg gee cag gcg gee ccg ceg gee ttg teg cag gae 
val gly leu gly leu ala thr ala pro ala gin ala ala pro pro ala leu ser gin asp 
121/41 151/51 

egg ttc gee gae ttc ccc gcg ctg ccc etc gac ccg tec gcg atg gtc gcc caa gtg ggg 
arg phe ala asp phe pro ala leu pro leu asp pro ser ala met val ala gin val gly 
181/61 211/71 

cca cag gtg gtc aac ate aac acc aaa ctg ggc tac aac aac gee gtg ggc gee ggg acc 
pro gin val val asn ile asn thr lys leu gly tyr asn asn ala val gly ala gly thr 
241/81 271/91 

ggc ate gtc ate gat ccc aac ggt gtc gtg ctg acc aac aac cac gtg ate gcg ggc gee 
gly ile val ile asp pro asn gly val val leu thr asn asn his val ile ala gly ala 
301/101 331/111 

ace gac ate aat gcg ttc age gtc ggc tec ggc caa acc tac ggc gtc gat gtg gtc ggg 
thr asp ile asn ala phe ser val gly ser gly gin thr tyr gly val asp val val gly 
361/121 391/131 

tat gac cgc acc cag gat gtc gcg gtg ctg cag ctg cgc ggt gee ggt ggc ctg ccg teg 
tyr asp arg thr gin asp val ala val leu gin leu arg gly ala gly gly leu pro ser 
421/141 451/151 

gcg gcg ate ggt ggc ggc gtc gcg gtt ggt gag ccc gtc gtc gcg atg ggc aac age ggt 
ala ala ile gly gly gly val ala val gly glu pro val val ala met gly asn ser gly 
481/161 511/171 

ggg cag ggc gga aeg ccc cgt gcg gtg cct ggc agg gtg gtc gcg etc ggc caa acc gtg 
gly gin gly gly thr pro arg ala val pro gly arg val val ala leu gly gin thr val 
541/181 571/191 

cag gcg teg gat teg ctg acc ggt gee gaa gag aca ttg aac ggg ttg ate cag ttc gat 
gin ala ser asp ser leu thr gly ala glu glu thr leu asn gly leu ile gin phe asp 
601/201 631/211 

gee gcg ate cag ccc ggt gat teg ggc ggg ccc gtc gtc aac ggc eta gga cag gtg gtc 
ala ala ile gin pro gly asp ser gly gly pro val val asn gly leu gly gin val val 
661/221 691/231 

ggt atg aac aeg gcc gcg tec gat aac ttc cag ctg tec cag ggt ggg cag gga ttc gcc 
gly met asn thr ala ala ser asp asn phe gin leu ser gin gly gly gin gly phe ala 
721/241 751/251 

att ccg ate ggg cag gcg atg gcg ate gcg ggc cag ate cga teg ggt ggg ggg tea ccc 
ile pro ile gly gin ala met ala ile ala gly gin ile arg ser gly gly gly ser pro 
781/261 811/271 

acc gtt cat ate ggg cct acc gcc ttc etc ggc ttg ggt gtt gtc gac aac aac ggc aac 
thr val his ile gly pro thr ala phe leu gly leu gly val val asp asn asn gly asn 
841/281 871/291 

ggc gea cga gtc caa cgc gtg gtc ggg age get ccg gcg gca agt etc ggc ate tec acc 
gly ala arg val gin arg val val gly ser ala pro ala ala ser leu gly ile ser thr 
901/301 931/311 

ggc gae gtg ate acc gcg gtc gac ggc get ccg ate aac teg gcc acc gcg atg gcg gac 
gly asp val ile thr ala val asp gly ala pro ile asn ser ala thr ala met ala asp 
961/321 991/331 

gcg ett aac ggg cat cat ccc ggt gac gtc ate teg gtg acc tgg caa acc aag teg ggc 
ala leu asn gly his his pro gly asp val ile ser val thr trp gin thr lys ser gly 
1021/341 1051/351 

ggc aeg cgt aca ggg aac gtg aca ttg gee gag gga ccc ccg gcc tga 
gly thr arg thr gly asn val thr leu ala glu gly pro pro ala OPA 
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ORF according to Cole et al,, 1998 (Nature 393:537-544) and containing Rv0125: 
1/1 31/11 

tag aaa aat cct gcc gcc egg acc ctt aag get ggg aca att tct gat age tac ccc gac 
AMB lys asn pro ala ala arg thr leu lys ala gly thr ile ser asp ser tyr pro asp 
61/21 91/31 

aca gga ggt tac ggg atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg ttg ctg 
thr gly gly tyr gly met ser asn ser arg arg arg ser leu arg trp ser trp leu leu 
121/41 151/51 

age gtg ctg get gcc gtc ggg ctg ggc ctg gee acg gcg ccg gcc cag gcg gcc ccg ccg 
ser val leu ala ala val gly leu gly leu ala thr ala pro ala gin ala ala pro pro 
181/61 211/71 

gcc ttg teg cag gac egg ttc gcc gac ttc ccc gcg ctg ccc etc gac ccg tec gcg atg 
ala leu ser gin asp arg phe ala asp phe pro ala leu pro leu asp pro ser ala met 
241/81 271/91 

gtc gcc caa gtg ggg cca cag gtg gtc aac ate aac acc aaa ctg ggc tac aac aac gcc 
val ala gin val gly pro gin val val asn ile asn thr lys leu gly tyr asn asn ala 
301/101 331/111 

gtg ggc gcc ggg acc ggc ate gtc ate gat ccc aac ggt gtc gtg ctg ace aac aac cae 
val gly ala gly thr gly ile val ile asp pro asn gly val val leu thr asn asn his 
361/121 391/131 

gtg ate gcg ggc gcc acc gac ate aat gcg ttc age gtc ggc tec ggc caa acc tac ggc 
val ile ala gly ala thr asp ile asn ala phe ser val gly ser gly gin thr tyr gly 
421/141 451/151 

gtc gat gtg gtc ggg tat gac cgc acc cag gat gtc gcg gtg ctg cag ctg cgc ggt gcc 
val asp val val gly tyr asp arg thr gin asp val ala val leu gin leu arg gly ala 
481/161 511/171 

ggt ggc ctg ccg teg gcg gcg ate ggt ggc ggc gtc gcg gtt ggt gag ccc gtc gtc gcg 
gly gly leu pro ser ala ala ile gly gly gly val ala val gly glu pro val val ala 
541/181 571/191 

atg ggc aac age ggt ggg cag ggc gga acg ccc cgt gcg gtg cct ggc agg gtg gtc gcg 
met gly asn ser gly gly gin gly gly thr pro arg ala val pro gly arg val val ala 
601/201 631/211 

etc ggc caa acc gtg cag gcg teg gat teg ctg acc ggt gcc gaa gag aca ttg aac ggg 
leu gly gin thr val gin ala ser asp ser leu thr gly ala glu glu thr leu asn gly 
661/221 691/231 

ttg ate cag ttc gat gcc gcg ate cag ccc ggt gat teg ggc ggg ccc gtc gtc aac ggc 
leu ile gin phe asp ala ala ile gin pro gly asp ser gly gly pro val val asn gly 
721/241 751/251 

eta gga cag gtg gtc ggt atg aac acg gcc gcg tec gat aac ttc cag ctg tec cag ggt 
leu gly gin val val gly met asn thr ala ala ser asp asn phe gin leu ser gin gly 
781/261 811/271 

ggg cag gga ttc gee att ccg ate ggg cag gcg atg gcg ate gcg ggc cag ate ega teg 
gly gin gly phe ala ile pro ile gly gin ala met ala ile ala gly gin ile arg ser 
841/281 871/291 

ggt ggg ggg tea ccc acc gtt cat ate ggg cct acc gcc ttc etc ggc ttg ggt gtt gtc 
gly gly gly ser pro thr val his ile gly pro thr ala phe leu gly leu gly val val 
901/301 931/311 

gac aac aac ggc aac ggc gca cga gtc caa cgc gtg gtc ggg age get ccg gcg gea agt 
asp asn asn gly asn gly ala arg val gin arg val val gly ser ala pro ala ala ser 
961/321 991/331 

etc ggc ate tec acc ggc gac gtg ate acc gcg gtc gac ggc get ccg ate aac teg gcc 
leu gly ile ser thr gly asp val ile thr ala val asp gly ala pro ile asn ser ala 
1021/341 1051/351 

acc gcg atg gcg gac gcg ctt aac ggg cat cat ccc ggt gac gtc ate teg gtg acc tgg 
thr ala met ala asp ala leu asn gly his his pro gly asp val ile ser val thr trp 
1081/361 1111/371 

caa ace aag teg ggc ggc acg cgt aca ggg aac gtg aca ttg gcc gag gga ccc ccg gee 

gin thr lys ser gly gly thr arg thr gly asn val thr leu ala glu gly pro pro ala 

1141/381 

tga 

OPA 
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